iDentification of pIG breeds (iDIGs)

iDIGs v2.0 includes a SNP dataset of a wide range of pig breeds. The dataset are used for pig breed idetification and panel design. See How to for detail functions.

Reference Samples Distribution

3605 samples

Genome-wide information were used

124 pig breeds

Covering main pig breeds across the world

45 sites

Samples were distributed in 45 sites of 24 countries

PLS distribution of all samples

All sample info

All breed info

Breed Identification analysis

First step: Data upload


Second step: Choose reference breeds (Optional)


Third step: E-mail address and submit

Design a small panel for breed identification of some specfic breeds

First step: Select panel density


Second step: Choose reference breeds


Hints: This files are required if you choose to upload your own breed information, otherwise the information in the database will be used for the panel design.

Third step: E-mail address and submit

Kinship Analysis

Step 1: Select Species and Maximum Relationship Class

Select Species


Maximum Relationship Class

Step 2: Upload Plink Binary Files

Step 3: E-mail Address and Submit

Sample Swap Detection

Step 1: Select Species

Select Species

Step 2: Upload Files

Upload Files

Upload Genomic PLINK Files
Upload Non-Genomic PLINK Files
Upload VCF Files

Hints: Because the original VCF files are large, you can use our scripts to process them and generate genotype-processed VCF files for upload.

Step 3: E-mail Address and Submit

Introduction

Welcome to iDentification of pIG breeds (iDIGs) v2.0 !

iDIGs is a tool for global pig breed identification. Our database has 3,622 samples and 124 breeds worldwide. Now, iDIGs has two mian functions: pig breed identification and panel design for specific breeds identification.

  1. Breed Identification
  2. Panel Design
Function

1. Breed Identification

Using our database to recognize users' pig breed

  • Step1: Choose the version of your reference pig genome (10.2 or 11.1) and upload your PLINK binary file. No missing SNPs are allowed . You can perform imputation using PHARP or Beagle.
  • Step2: Choose reference breeds ( Optional ). If not chose, all breeds in our database will be used as reference.
  • Step3: Input your E-mail address and submit. The results will send to the E-mail address automatically.
  • Results: A table of two columns. The first column is the input sampleID. The second column is the putative breed label.

2. Panel Design

Designing a small panel to distinguish some specific breeds from each other.

  • Step1: Choose the version of your reference pig genome (10.2 or 11.1) and the SNP number in the panel. Note that, the SNP number should be less than 10,000.
  • Step2: choose breeds you want to distinguish from. You can select multiple breed IDs or a single breed ID. If you input multiple breed IDs, markers that can distinguish all input breeds from each other will be selected. If you input a single breed ID, Markers that can distinguish your input breed from all other breeds in our database will be selected.
  • Step3(non-essential):If you want to use the breeds you have for panel design, you can upload your BIM file and frq.strat file .
  • Step4: Input your E-mail address and submit. The results will send to the E-mail address automatically.
  • Results:

  • Marker ID used in panel;
  • Performance estimation of the panel using 5-fold CV ( Only fully use the breed in iDIGs's database ).
  • The preformance can be imporved by increasing the SNP number.

FAQ

Q: What type of data may i uploaded when using iDIGs?

A: We recommand commercial SNP Chips and WGS.

Q: How could i upload big PLINK binary files, such as SNPs from WGS data?

A: Big data (file size > 1 GB), such as WGS data, is hardly upload. Thus, you can download our markerID and only extract overlapped markers to upload, The relevant commands are as follows:

# If using LINUX
chmod +x MakeSNPidL

# Normalize your markerID in bim file and copy files
MakeSNPidL your.bim your.tmp.bim
cp your.bed your.tmp.bed
cp your.fam your.tmp.fam

# Extract overlapped SNPs
plink --bfile your.tmp --extract MarkerID_11.txt --make-bed --out your_filename

Q:How to generate the files(.frq.strat file) required by the panel design module?

A:The relevant commands are as follows:

# Run iDIGs with file.tmp.bed(bim,fam)
# The cluster file is the first two columns of information for your fam file
plink --bfile your_filename --freq --allow-extra-chr --within cluster.txt --out upload

You can also click this specific example when you have problems processing files.

Tips : The principle is to use the user's SNPs that are the same as those in the database samples for panel design,so it is necessary to extract the repeated SNPs.

Contact

Any questions, bug reports and suggestions can be posted to E-mail: pan_idigs@163.com .

If you find the results useful, please also help spread the word by sharing about the iDIGs on Twitter, Reddit, or any other social media platform you regularly use.

Hints

This software is intended for academic use only and not for commercial use. We are not legally responsible for any unauthorized commercial application.