The Retrieval-Augmented Generation search tool (DogRAG) employs a Large Language Model (LLM) based on the knowledge and annotated data in iDog to provide convenient question-answering about dog breeds, diseases, genetic variations, and gene expression.
Dog Visual Classification (DogVC) is a web tool designed to identify the dog breed from the uploaded dog picture. It uses deep learning method to predict the probability of dog breed. Additionally, a detail link to iDog phenotype dataset is also provided, and a detail link to iDog phenotype dataset is also provided, which includes breed general information, registry details, disease and variations.
Dog SNP Browser provides Genomic SNPs, Genes, Methylation data for Canfam4.
The Assembly converter offers the conversion of genome coordinates across four distinct canine genome assemblies based on CrossMap. Both the BED and VCF format are supported.
Haplotype supports haplotype analysis with the SNP/INDEL data of the 1,929 samples and 352 breeds. The haplotype frequency, haplotype were calculated using Haplotyp tools integrated from GVM. The linkage disequilibrium calculated and visualized using LDBlockShow tool.
The fixation index (FST) is a measure used to quantify population differentiation. It is frequently estimated from genetic polymorphism data, such as single-nucleotide polymorphisms (SNP) or microsatellites. Developed as a special case of Wright's F-statistics, it is one of the most commonly used statistics in population genetics. This tool can help calculate the Fst value using VCFTools based on the genomic variations.
GO Enrichment is used to perform GO enrichment analysis on selected genes of Dog (Canfam4) using clusterProfiler, and the analysis results are displayed in the form of tables and pictures.
Pi (π) is a measure of nucleotide diversity within a population. Higher π values indicate greater nucleotide diversity within the population. Lower π values suggest smaller nucleotide diversity within the population. This tool can help calculate the θπ value using VCFTools based on the genomic variations.
Tajima's D is calculated as the difference between two measures of genetic diversity: the mean number of pairwise differences and the number of segregating sites. The measures are expected to be the same in a neutrally evolving population of constant size. This tool can help calculate the Tajima's D value using VCFTools based on the genomic variations.
ExpPattern generates a heatmap of gene-tissue expression patterns for the input genes and estimates the level of tissue specificity. The tissue specificity indices are computed using the Tspex tool.
The Expression Normalization Tool is to normalize the COUNT/TPM/FPKM value using the formula "log2(value + 1)".
The Expression Standardization Tool is to calculate the Z-score.
KEGG Enrichment is used to perform KEGG enrichment analysis on selected genes of Dog (Canfam4) using clusterProfiler, and the analysis results are displayed in the form of tables and pictures. The kegg annotation is obtained using EGGNOG-mapper.