Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
Research output: Contribution to journal › Journal article › Research › peer-review
Final published version, 537 KB, PDF document
Jan Gorodkin, Susanna Cirera, Jakob Hedegaard, Michael J Gilchrist, Frank Panitz, Claus Jørgensen, Karsten Scheibye-Knudsen, Troels Arvin, Steen Lumholdt, Milena Sawera, Trine Green, Bente Nielsen, Jakob H Havgaard, Carina Rosenkilde, Jun Wang, Heng Li, Ruiqiang Li, Bin Liu, Songnian Hu, Wei Dong & 16 others
Results: Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories.
Conclusion: This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies.
|Number of pages||16|
|Publication status||Published - 2007|
- Animals, Cluster Analysis, Computational Biology, Expressed Sequence Tags, Gene Expression, Gene Expression Profiling, Gene Library, Genomics, Multigene Family, RNA, Messenger, Swine
Number of downloads are based on statistics from Google Scholar and www.ku.dk