GSA_STAT_reads_classification Using STAT in GSA database for short reads classification
Introduction
This package is the pipeline using STAT tools for GSA database to analyze the raw sequence of short reads. We build the local K-mer database which uses Refseq(2023-03 217) and Taxonomy (20230315) . We also revise the script to show the real statistics percentage of "unidentified reads", input of .fastq.gz format file, and optimized the steps of matching species information from hits information.
Publications
No Publication Information
Credits
- Tianhao Huang huangth@big.ac.cn Investigator
National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences / China National Center for Bioinformation, China
Community Ratings
Usability | Efficiency | Reliability | Rated By |
---|---|---|---|
1 users | |||
Sign in to rate | |||
970***3@qq.com (July 4, 2023) |
Summary
Accession | BT007358 |
---|---|
Tool Type | Application |
Category | Unclassified tools |
Platforms | Linux/Unix |
Technologies | BASH, C++, Python3 |
User Interface | Terminal Command Line |
Input Data | FASTA, FASTQ |
Latest Release | 1.0 (May 26, 2023) |
Download Count | 223 |
Country/Region | China |
Submitted By | Tianhao Huang |
Fundings
XDB38050300; 2023YFC2605700