Haplotype

Functions

Uploading sequence mutations and corresponding sample metadata (file format can refer to the help block), haplotype network will be constructed via McAN (minimum-cost arborescence-based haplotype network). Based on constructed haplotype network, community lineages will be determined by Newman’s method.
Currently, it supports the network construction of SARS-CoV-2 and other species. Especially for SARS-CoV-2, you can freely select a specific dataset from Resource for Coronavirus 2019 (RCoV19)) as the background data.

Select Species

Select File Format

Mutation File    Mutation File example
Upload mutation file
Metadata File    Metadata file example
Upload metadata file
Select a specific dataset in RCoV19   
Parameter for constructing haplotype network
Email (Results will be notified via email when the calculating time is long)
Email
 
Subject
Run
Reminder: Running tasks: , Tasks in queue: .
Help
1. Mutation File

We recommend mutation data in genovar format, the genovar format is a text file format. Each row represents a sample, contains sample name, accession ID, and mutations details split by ‘;’, and columns are separated by tab key.

NameAccessionMutations
hCoV-19/human/USA/TX-DSHS-000508/2020EPI_ISL_2264424490(SNP:T->A);3177(SNP:C->T);6040(SNP:C->T);6843(SNP:C->T);8782(SNP:C->T);8950(SNP:C->T);12478(SNP:G->A);18736(SNP:T->C);24034(SNP:C->T);26729(SNP:T->C);26801(SNP:C->T);28077(SNP:G->C);28144(SNP:T->C);28896(SNP:C->G);29451(SNP:C->T);29700(SNP:A->G)
hCoV-19/human/USA/TX-DSHS-000511/2020EPI_ISL_22644323003(SNP:A->T);8782(SNP:C->T);10811(SNP:C->T);10813(SNP:T->A);17747(SNP:C->T);17858(SNP:A->G);18060(SNP:C->T);24694(SNP:A->T);28144(SNP:T->C)
hCoV-19/human/USA/TX-DSHS-000513/2020EPI_ISL_22644343003(SNP:A->T);8782(SNP:C->T);10811(SNP:C->T);10813(SNP:T->A);17747(SNP:C->T);17858(SNP:A->G);18060(SNP:C->T);24694(SNP:A->T);28144(SNP:T->C)
hCoV-19/human/USA/TX-DSHS-000515/2020EPI_ISL_2264437241(SNP:C->T);3037(SNP:C->T);8664(SNP:C->T);14408(SNP:C->T);15026(SNP:C->T);15264(SNP:T->C);23403(SNP:A->G);27575(SNP:C->T)
hCoV-19/human/USA/TX-DSHS-000502/2020EPI_ISL_2264410241(SNP:C->T);1059(SNP:C->T);3037(SNP:C->T);3068(SNP:G->A);9169(SNP:C->T);14408(SNP:C->T);23403(SNP:A->G);25563(SNP:G->T)
2. VCF File

VCF is a text file format. It contains meta-information lines, a header line, and then data lines each containing information about a position in the genome. The format also has the ability to contain genotype information on samples for each position.

##fileformat=VCFv4.1
##reference=SARS-CoV-2
##contig=<ID=MN908947.3,length=29903>
#CHROMPOSIDREFALTQUALFILTERINFOFORMATSmapleASmapleBSmapleC
RCoV2490.TA...GTAATTTT
RCoV23003.AT...GTAATTTT
RCoV28782.CT...GTTTTTTT
RCoV210811.CT...GTCCTTTT
RCoV217747.CT...GTCCTTTT
3. Metadata File

Sample metadata in tabular tab-delimited text file format is need, example like:

AccessionSampling DateSampling Location
EPI_ISL_22494792020-04-13United States
EPI_ISL_22547262020-04-24 Slovakia
EPI_ISL_22700902020-05-11Switzerland
EPI_ISL_22740272020-05-14Haiti
EPI_ISL_22788202020-04-27Sweden