HU Songnian


Major Research: Genomics, Molecular Biology and Molecular Genetics

1. Genomics

We perform whole genome sequence, including sequencing, assembling, finishing and so on. We also analysis the composing and structure of the whole genome. Now we mainly focused on Microbial Genomes and re-sequencing of important genome such as human, rice and so on.

2. Functional Genomics

We mainly focused on gene expression, regulation in the genome level, gene associated information reorganization and identification and genome diversity and compare.

3. Transcriptomics

We performed comprehensive transcriptomic study both by traditional way, such as EST, SAGE and so on, and new way based on new generation sequencing (Solid), mainly focused on detecting and compare of the transcriptome. For example, we performed a study on the molecular mechanism of heterosis. Based on the compare analysis between F1 and parental line, we detected a lot of heterosis associated genes and analyzed the expression pattern of them which provide us a global view of heterosis associated genes and laid data foundation for further study.

4. Bioinformatics

The development and improvement of bioinformatic tools and construction of associated databases. For example, in the research of genetic molecular mechanism of heterosis, this group mainly adopt bioinformatics methods: improve analysis methods of transcriptome data、mining、 integrating and using network theory study the heherosis molecular mechanism. Now, they have finished improving the SAGE mapping methods and constructed the “HRGD” database.

5. Others

1.MicroRNAs and their functions

The group of microRNA research is focus on miRNAomics in different species, including Trichomonas vaginalis, Apis mellifera, Tribolium castaneum and Bombyx mori. We discovered miRNAs in Protozoa (Trichomonas vaginalis) for the first time using a computational and experimental combined approach. In the miRNA research of Tribolium castaneum and Bombyx mori, we developed an elaborate computational protocol for prediction of insects’miRNA and an effective method for identification of miRNA. Taking a combined approach, we identified 118 conserved miRNAs and 151 novel miRNA candidates from the silkworm genome sequence, the expression pattern shows that molting stages is hotspots of miRNA expression both in sorts and quantities.

2. Data mining of the epigenomics data

Along with the development of the 2rd sequencing technology, we got a great of histone and DNA methylation modification data, e.g. human 20 histone modification profile、the comparative modification profile of human and mouse and so on. How to integrate and mining these raw data? We will construct compute models using data mining tools; propose the hypotheses of the relationship between epigenetic modification and gene expression.


Group Leader

Prof. HU Songnian received his Ph.D degree in Plant Molecular Biology from China Agricultural University in 1996. He became the program director and main participator of International Human Genome Projects—Beijing region at BGI in 1999, set up the largest genome sequencing platform in China. In 2001, he lead the team to successfully finish the draft sequence of rice genome (Oryza sativa L. ssp. indica) within 50 days and made major innovations in reducing the sequencing cost and increasing the efficiency. he was awarded “Outstanding Scientific and Technology achievement award for a research group” of Qiu Shi Scientific and Technology Foundation in 2002 and “Outstanding Scientific and Technology achievement award for a research group” of Chinese Academy of Sciences in 2003 due to his standout contribution to “Human Genome Project for China Region”, “China Super Hybrid Rice Genome Project”, and the excellent behavior in setting up large-scale DNA sequencing platform. He has also participated other genome projects carried out at BIG such as “Sino-Danish Porcine Genome Project”, silkworm genome project and chicken genome variation mapping project. He has published more than 60 papers, two monographs and participated in another two monographs.


Recent Progress:

  Identification and analysis of mouse non-coding RNA using transcriptome data 

Transcripts are expressed spatially and temporally and they are very complicated, precise and specific; however, most studies are focused on protein-coding related genes. Recently, massively parallel cDNA sequencing (RNA-seq) has emerged to be a new and promising tool for transcriptome research, and numbers of non-coding RNAs (ncRNAs), especially long intergenic non-coding RNAs (lincRNAs), have been widely identified and well characterized as important regulator of diverse biological processes. In this study, we obtained ultra-deep RNA-seq data of 15 mouse tissues and these data gave us the opportunity to study the diversity and dynamic of ncRNA deeply in mouse. Using our developed workflow, we totally identified 16,280 ncRNA genes in mouse. We annotated these ncRNAs by diverse properties and found ncRNAs are generally shorter, have fewer exons, express in lower level and are more strikingly tissue-specific compared with protein-coding genes. Moreover, these ncRNAs show significant enrichments with transcriptional initiation and elongation signals including histone modifications (H3K4me3, H3K27me3 and H3K36me3), RNAPII binding sites and CAGE tags. The Gene Set Enrichment Analysis (GESA) revealed several sets of lincRNAs associated with diverse biological processes such as immune effector process, muscle development and sexual reproduction etc. Taken together, this study provides a new annotation of mouse non-coding genes and gives an opportunity for future functional and evolutionary study of mouse ncRNAs.

The basic genomic features of non-coding genes compared with protein-coding genes. (a) Exon number comparison.(b) Gene length comparison.(c).The expression breadth.(d) The expression dynamics across different tissues.


