Show simple item record

dc.contributor.authorBradford, James R
dc.contributor.authorHey, Yvonne
dc.contributor.authorYates, Tim
dc.contributor.authorLi, Yaoyong
dc.contributor.authorPepper, Stuart D
dc.contributor.authorMiller, Crispin J
dc.date.accessioned2010-09-18T08:41:24Z
dc.date.available2010-09-18T08:41:24Z
dc.date.issued2010
dc.identifier.citationA comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling. 2010, 11:282 BMC Genomicsen
dc.identifier.pmid20444259
dc.identifier.doi10.1186/1471-2164-11-282
dc.identifier.urihttp://hdl.handle.net/10541/111353
dc.description.abstractBACKGROUND: RNA-Seq exploits the rapid generation of gigabases of sequence data by Massively Parallel Nucleotide Sequencing, allowing for the mapping and digital quantification of whole transcriptomes. Whilst previous comparisons between RNA-Seq and microarrays have been performed at the level of gene expression, in this study we adopt a more fine-grained approach. Using RNA samples from a normal human breast epithelial cell line (MCF-10a) and a breast cancer cell line (MCF-7), we present a comprehensive comparison between RNA-Seq data generated on the Applied Biosystems SOLiD platform and data from Affymetrix Exon 1.0ST arrays. The use of Exon arrays makes it possible to assess the performance of RNA-Seq in two key areas: detection of expression at the granularity of individual exons, and discovery of transcription outside annotated loci. RESULTS: We found a high degree of correspondence between the two platforms in terms of exon-level fold changes and detection. For example, over 80% of exons detected as expressed in RNA-Seq were also detected on the Exon array, and 91% of exons flagged as changing from Absent to Present on at least one platform had fold-changes in the same direction. The greatest detection correspondence was seen when the read count threshold at which to flag exons Absent in the SOLiD data was set to t<1 suggesting that the background error rate is extremely low in RNA-Seq. We also found RNA-Seq more sensitive to detecting differentially expressed exons than the Exon array, reflecting the wider dynamic range achievable on the SOLiD platform. In addition, we find significant evidence of novel protein coding regions outside known exons, 93% of which map to Exon array probesets, and are able to infer the presence of thousands of novel transcripts through the detection of previously unreported exon-exon junctions. CONCLUSIONS: By focusing on exon-level expression, we present the most fine-grained comparison between RNA-Seq and microarrays to date. Overall, our study demonstrates that data from a SOLiD RNA-Seq experiment are sufficient to generate results comparable to those produced from Affymetrix Exon arrays, even using only a single replicate from each platform, and when presented with a large genome.
dc.language.isoenen
dc.subjectTumour Cell Lineen
dc.subject.meshAlternative Splicing
dc.subject.meshCell Line, Tumor
dc.subject.meshChromosomes, Human, Y
dc.subject.meshExons
dc.subject.meshGene Expression
dc.subject.meshHumans
dc.subject.meshOligonucleotide Array Sequence Analysis
dc.subject.meshSequence Analysis, DNA
dc.subject.meshTranscription, Genetic
dc.titleA comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling.en
dc.typeArticleen
dc.identifier.eissn1471-2164
dc.contributor.departmentApplied Computational Biology and Bioinformatics, Cancer Research UK, Paterson Institute for Cancer Research, The University of Manchester, Wilmslow Road, Manchester M20 4BX UK.en
dc.identifier.journalBMC Genomicsen
html.description.abstractBACKGROUND: RNA-Seq exploits the rapid generation of gigabases of sequence data by Massively Parallel Nucleotide Sequencing, allowing for the mapping and digital quantification of whole transcriptomes. Whilst previous comparisons between RNA-Seq and microarrays have been performed at the level of gene expression, in this study we adopt a more fine-grained approach. Using RNA samples from a normal human breast epithelial cell line (MCF-10a) and a breast cancer cell line (MCF-7), we present a comprehensive comparison between RNA-Seq data generated on the Applied Biosystems SOLiD platform and data from Affymetrix Exon 1.0ST arrays. The use of Exon arrays makes it possible to assess the performance of RNA-Seq in two key areas: detection of expression at the granularity of individual exons, and discovery of transcription outside annotated loci. RESULTS: We found a high degree of correspondence between the two platforms in terms of exon-level fold changes and detection. For example, over 80% of exons detected as expressed in RNA-Seq were also detected on the Exon array, and 91% of exons flagged as changing from Absent to Present on at least one platform had fold-changes in the same direction. The greatest detection correspondence was seen when the read count threshold at which to flag exons Absent in the SOLiD data was set to t<1 suggesting that the background error rate is extremely low in RNA-Seq. We also found RNA-Seq more sensitive to detecting differentially expressed exons than the Exon array, reflecting the wider dynamic range achievable on the SOLiD platform. In addition, we find significant evidence of novel protein coding regions outside known exons, 93% of which map to Exon array probesets, and are able to infer the presence of thousands of novel transcripts through the detection of previously unreported exon-exon junctions. CONCLUSIONS: By focusing on exon-level expression, we present the most fine-grained comparison between RNA-Seq and microarrays to date. Overall, our study demonstrates that data from a SOLiD RNA-Seq experiment are sufficient to generate results comparable to those produced from Affymetrix Exon arrays, even using only a single replicate from each platform, and when presented with a large genome.


This item appears in the following Collection(s)

Show simple item record