dc.contributor.advisor | Langmead, Ben | |
dc.creator | Wilks, Christopher Nathan | |
dc.date.accessioned | 2022-03-23T14:46:07Z | |
dc.date.available | 2022-03-23T14:46:07Z | |
dc.date.created | 2020-12 | |
dc.date.issued | 2020-10-14 | |
dc.date.submitted | December 2020 | |
dc.identifier.uri | http://jhir.library.jhu.edu/handle/1774.2/66892 | |
dc.description.abstract | As more and larger genomics studies appear, there is a growing need for
comprehensive and queryable cross-study summaries. We focus primarily on nearly 20,000 RNA-sequencing studies in human and mouse, consisting of
more than 750,000 sequencing runs, and the coverage summaries derived from their alignment to their respective gnomes. In addition to the summarized
RNA-seq derived data itself we present tools (Snaptron, Monorail, Megadepth,
and recount3) that can be used by downstream researchers both to process their own data into comparable summaries as well as access and query our processed, publicly available data. Additionally we present a related study of errors in the splicing of long read transcriptomic alignments, including comparison
to the existing splicing summaries from short reads already described
(LongTron). | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | |
dc.publisher | Johns Hopkins University | |
dc.subject | big data, RNA-seq, genomics, cloud | |
dc.title | ENABLING EFFICIENT AND STREAMLINED ACCESS TO LARGE SCALE GENOMIC EXPRESSION AND SPLICING DATA | |
dc.type | Thesis | |
thesis.degree.discipline | Bioinformatics | |
thesis.degree.grantor | Johns Hopkins University | |
thesis.degree.grantor | Whiting School of Engineering | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Ph.D. | |
dc.date.updated | 2022-03-23T14:46:08Z | |
dc.type.material | text | |
thesis.degree.department | Computer Science | |
dc.contributor.committeeMember | Schatz, Michael C. | |
dc.contributor.committeeMember | Florea, Liliana | |
dc.publisher.country | USA | |