Long-Read Transcriptome of Equine Bronchoalveolar Cells.
Sophie Elena Sage, Pamela Nicholson, Tosso Leeb, Vinzenz Gerber, Vidhya Jagannathan
Author Information
Sophie Elena Sage: Swiss Institute of Equine Medicine, Department of Clinical Veterinary Medicine, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland. ORCID
Pamela Nicholson: Next Generation Sequencing Platform, University of Bern, 3001 Bern, Switzerland.
Tosso Leeb: Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland. ORCID
Vinzenz Gerber: Swiss Institute of Equine Medicine, Department of Clinical Veterinary Medicine, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland. ORCID
Vidhya Jagannathan: Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland. ORCID
We used Pacific Biosciences long-read isoform sequencing to generate full-length transcript sequences in equine bronchoalveolar lavage fluid (BALF) cells. Our dataset consisted of 313,563 HiFi reads comprising 805 Mb of polished sequence information. The resulting equine BALF transcriptome consisted of 14,234 full-length transcript isoforms originating from 7017 unique genes. These genes consisted of 6880 previously annotated genes and 137 novel genes. We identified 3428 novel transcripts in addition to 10,806 previously known transcripts. These included transcripts absent from existing genome annotations, transcripts mapping to putative novel (unannotated) genes and fusion transcripts incorporating exons from multiple genes. We provide transcript-level data for equine BALF cells as a resource to the scientific community.