Identification and characterization of transcript polymorphisms in soybean lines varying in oil composition and content.

Wolfgang Goettel, Eric Xia, Robert Upchurch, Ming-Li Wang, Pengyin Chen, Yong-Qiang Charles An
Author Information
  1. Yong-Qiang Charles An: USDA-ARS, Midwest Area, Plant Genetics Research Unit at Donald Danforth Plant Science Center, 975 N Warson Rd, St, Louis, MO 63132, USA. Yong-Qiang.An@ars.usda.gov.

Abstract

BACKGROUND: Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement.
RESULTS: In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively.
CONCLUSIONS: As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality.

MeSH Term

Chromosomes, Plant
Cluster Analysis
Gene Expression Profiling
Genotype
INDEL Mutation
Lipid Metabolism
Metabolic Networks and Pathways
Multigene Family
Organ Specificity
Phenotype
Polymorphism, Genetic
Polymorphism, Single Nucleotide
Quantitative Trait Loci
Seeds
Sequence Analysis, RNA
Soybean Oil
Glycine max
Transcriptome

Chemicals

Soybean Oil