R package:blupADC-Feature 2
Table of Contents
Overview
🤖 Generally, most genotype data need to perform quality control and imputation before applying in animal and plant breeding. For convenience, package blupADC provides genotype_data_QC_Imputation function to perform quality control and imputation by interfacing with software Plink and software Beagle in an easy way (we only need to provide the software path and software name).
👉 Note: For convenience, blupADC has encapsulated software Plink(for quality control) version-1.9 and software Beagle(for imputation) version-5.2. If you want change this version, you should set the related parameters in the part of Advanced parameter.
Example
library(blupADC)
geno_qc_impute(
input_data_hmp=example_data_hmp, #provided hapmap data object
data_analysis_method="QC_Imputation", #analysis method type,QC + imputatoin
output_data_path=getwd(), #output data path
output_data_name="YY_data", #output data name
output_data_type="VCF" #output data format
)
In the process of quality control and imputation,we should provide genotype data,these parameters are the as in geno_format function(see more details).
Parameter
💙Basic
1:data_analysis_method
Method of analyzing data,
characterclass."QC": only perform quality control"Imputation": only perform imputation
"QC_Imputation": first perform quality control, and then perform imputation2:qc_snp_rate
Threshold of SNPs call rate in quality control, numeric class. Default is 0.1
- 3:qc_ind_rate
Threshold of individuals call rate in quality control, numeric class. Default is 0.1
- 4:qc_maf
Threshold of minor allele frequency(MAF) in quality control, numeric class. Default is 0.05
- 5:qc_hwe
Threshold of hardy weinberg equilibrium(HWE) in quality control, numeric class. Default is 1e-7
💜Advanced
- 6:plink_software_path
Path of software Plink , character class.
- 7:plink_software_name
Name of software Plink , character class.
- 8:beagle_software_path
Path of software Beagle , character class.
- 9:beagle_software_name
Name of software Beagle , character class.
- 10:beagle_ref_data_path
File path of reference data in imputation, character class.
- 11:beagle_ref_data_name
File name of reference data in imputation, character class.
- 12:beagle_ped_path
File path of pedigree data in imputation, character class.
- 13:beagle_ped_name
File name of pedigree data in imputation, character class.