Read Anywhere and on Any Device!

Subscribe to Read | $0.00

Join today and start reading your favorite books for Free!

Read Anywhere and on Any Device!

  • Download on iOS
  • Download on Android
  • Download on iOS

A Factor Analysis Approach to Transcription Regulatory Network Reconstruction Using Gene Expression Data

A Factor Analysis Approach to Transcription Regulatory Network Reconstruction Using Gene Expression Data

陈玮
0/5 ( ratings)
This dissertation, "A Factor Analysis Approach to Transcription Regulatory Network Reconstruction Using Gene Expression Data" by Wei, Chen, 陈玮, was obtained from The University of Hong Kong and is being sold pursuant to Creative Attribution 3.0 Hong Kong License. The content of this dissertation has not been altered in any way. We have altered the formatting in order to facilitate the ease of printing and reading of the dissertation. All rights not granted by the above license are retained by the author.



Reconstruction of Transcription Regulatory Network and Transcription Factor Activity from gene expression data is an important problem in systems biology. Currently, there exist various factor analysis methods for TRN reconstruction, but most approaches have specific assumptions not satisfied by real biological data. Network Component Analysis can handle such limitations and is considered to be one of the most effective methods. The prerequisite for NCA is knowledge of the structure of TRN. Such structure can be obtained from ChIP-chip or ChIP-seq experiments, which however have quite limited applications. In order to cope with the difficulty, we resort to heuristic optimization algorithm such as Particle Swarm Optimization , in order to explore the possible structures of TRN and choose the most plausible one. Regarding the structure estimation problem, we extend classical PSO and propose a novel Probabilistic binary PSO. Furthermore, an improved NCA called FastNCA is adopted to compute the objective function accurately and fast, which enables PSO to run efficiently. Since heuristic optimization cannot guarantee global convergence, we run PSO multiple times and integrate the results. Then GCV-LASSO is performed to estimate TRN. We apply our approach and other factor analysis methods on the synthetic data. The results indicate that the proposed PSOFastNCA-GCV-LASSO algorithm gives better estimation.

In order to incorporate more prior information on TRN structure and gene expression dynamics in the linear factor analysis model for improved estimation of TRN and TFAs, a linear Bayesian framework is adopted. Under the unified Bayesian framework, Bayesian Linear Sparse Factor Analysis Model and Bayesian Linear State Space Model are developed for instantaneous and dynamic TRN, respectively. Various approaches to incorporate partial and ambiguous prior network structure information in the Bayesian framework are proposed to improve performance in practical applications. Furthermore, we propose a novel mechanism for estimating the hyper-parameters of the distribution priors in our BLSFM and BLSSM models, which can significantly improve the estimation compared to traditional ways of hyper-parameter setting. With this development, reasonably good estimation of TFAs and TRN can be obtained even without use of any structure prior of TRN. Extensive numerical experiments are performed to investigate our developed methods under various settings, with comparison to some existing alternative approaches. It is demonstrated that our hyper-parameter estimation method improves the estimation of TFA and TRN in most settings and has superior performance, and that structure priors in general leads to improved estimation performance.

Regarding application to real biological data, we execute the PSO-FastNCAGCV-LASSO algorithm developed in the thesis using E. Coli microarray data and obtain sensible estimation of TFAs and TRN. We apply BLSFM without structure priors of TRN, BLSSM without structure priors as well as with partial structure priors to Yeast S. cerevisiae microarray data and obtain a reasonable estimation of TFAs and TRN.

10.5353/th_b4961778



Factor analysis
Genetic transcription - Regulation - Statistical metho
Language
English
Format
Paperback
Release
January 26, 2017
ISBN 13
9781361305225

A Factor Analysis Approach to Transcription Regulatory Network Reconstruction Using Gene Expression Data

陈玮
0/5 ( ratings)
This dissertation, "A Factor Analysis Approach to Transcription Regulatory Network Reconstruction Using Gene Expression Data" by Wei, Chen, 陈玮, was obtained from The University of Hong Kong and is being sold pursuant to Creative Attribution 3.0 Hong Kong License. The content of this dissertation has not been altered in any way. We have altered the formatting in order to facilitate the ease of printing and reading of the dissertation. All rights not granted by the above license are retained by the author.



Reconstruction of Transcription Regulatory Network and Transcription Factor Activity from gene expression data is an important problem in systems biology. Currently, there exist various factor analysis methods for TRN reconstruction, but most approaches have specific assumptions not satisfied by real biological data. Network Component Analysis can handle such limitations and is considered to be one of the most effective methods. The prerequisite for NCA is knowledge of the structure of TRN. Such structure can be obtained from ChIP-chip or ChIP-seq experiments, which however have quite limited applications. In order to cope with the difficulty, we resort to heuristic optimization algorithm such as Particle Swarm Optimization , in order to explore the possible structures of TRN and choose the most plausible one. Regarding the structure estimation problem, we extend classical PSO and propose a novel Probabilistic binary PSO. Furthermore, an improved NCA called FastNCA is adopted to compute the objective function accurately and fast, which enables PSO to run efficiently. Since heuristic optimization cannot guarantee global convergence, we run PSO multiple times and integrate the results. Then GCV-LASSO is performed to estimate TRN. We apply our approach and other factor analysis methods on the synthetic data. The results indicate that the proposed PSOFastNCA-GCV-LASSO algorithm gives better estimation.

In order to incorporate more prior information on TRN structure and gene expression dynamics in the linear factor analysis model for improved estimation of TRN and TFAs, a linear Bayesian framework is adopted. Under the unified Bayesian framework, Bayesian Linear Sparse Factor Analysis Model and Bayesian Linear State Space Model are developed for instantaneous and dynamic TRN, respectively. Various approaches to incorporate partial and ambiguous prior network structure information in the Bayesian framework are proposed to improve performance in practical applications. Furthermore, we propose a novel mechanism for estimating the hyper-parameters of the distribution priors in our BLSFM and BLSSM models, which can significantly improve the estimation compared to traditional ways of hyper-parameter setting. With this development, reasonably good estimation of TFAs and TRN can be obtained even without use of any structure prior of TRN. Extensive numerical experiments are performed to investigate our developed methods under various settings, with comparison to some existing alternative approaches. It is demonstrated that our hyper-parameter estimation method improves the estimation of TFA and TRN in most settings and has superior performance, and that structure priors in general leads to improved estimation performance.

Regarding application to real biological data, we execute the PSO-FastNCAGCV-LASSO algorithm developed in the thesis using E. Coli microarray data and obtain sensible estimation of TFAs and TRN. We apply BLSFM without structure priors of TRN, BLSSM without structure priors as well as with partial structure priors to Yeast S. cerevisiae microarray data and obtain a reasonable estimation of TFAs and TRN.

10.5353/th_b4961778



Factor analysis
Genetic transcription - Regulation - Statistical metho
Language
English
Format
Paperback
Release
January 26, 2017
ISBN 13
9781361305225

More books from 陈玮

Rate this book!

Write a review?

loader