Internal functions

Documentation for JWAS.jl's internal (private) interface, which are not available to general users. These internal functions are small blocks that public function build on.

<!–-

Index

JWAS.Packed2BitBackend
DataAPI.describe
JWAS.GWAS
JWAS.GWAS
JWAS.PedModule.get_info
JWAS.PedModule.get_pedigree
JWAS.SSBRrun
JWAS.add_genotypes
JWAS.add_term
JWAS.annotated_bayesr_mt_pattern
JWAS.annotation_binary_bounds!
JWAS.bayesr_nested_step_indicators
JWAS.block_rhs!
JWAS.build_model
JWAS.center!
JWAS.check_marker_memory_guard!
JWAS.decode_marker!
JWAS.estimate_marker_memory
JWAS.finalize_marker_annotation_setup!
JWAS.format_bytes_human
JWAS.getEBV
JWAS.getMCMCinfo
JWAS.getMME
JWAS.get_column_ref
JWAS.get_genotypes
JWAS.gibbs_update_bayesc_binary_annotation_coefficients!
JWAS.gibbs_update_binary_probit_annotation_coefficients!
JWAS.is_unit_weights
JWAS.load_streaming_backend
JWAS.make_incidence_matrices
JWAS.mkDict
JWAS.mkmat_incidence_factor
JWAS.outputEBV
JWAS.outputMCMCsamples
JWAS.output_MCMC_samples
JWAS.output_MCMC_samples_setup
JWAS.output_location_parameters_samples
JWAS.prediction_setup
JWAS.prepare_streaming_genotypes
JWAS.runMCMC
JWAS.sample_annotation_effect_variance
JWAS.sample_binary_annotation_liabilities!
JWAS.sample_nested_annotation_probit_step!
JWAS.set_covariate
JWAS.set_random
JWAS.set_random
JWAS.solve
JWAS.streaming_mul_alpha!
JWAS.transubstrarr
JWAS.update_bayesc_binary_bounds!

Internal interface

DataAPI.describe — Method

describe(model::MME)

Print out model information.

source

JWAS.GWAS — Method

GWAS(model,map_file,marker_effects_file...;
     window_size = "1 Mb",sliding_window = false,
     GWAS = true, threshold = 0.001,
     genetic_correlation = false,
     header = true)

run genomic window-based GWAS

MCMC samples of marker effects are stored in markereffectsfile with delimiter ','.
model is either the model::MME used in analysis or the genotype cavariate matrix M::Array
map_file has the (sorted) marker position information with delimiter ','. If the map file is not provided, i.e., map_file=false, a fake map file will be generated with window_size markers in each 1 Mb window, and each 1 Mb window will be tested.
If two markereffectsfile are provided, and genetic_correlation = true, genomic correlation for each window is calculated.
Statistics are computed for nonoverlapping windows of size window_size by default. If sliding_window = true, those for overlapping sliding windows are calculated.
map file format:

markerID,chromosome,position
m1,1,16977
m2,1,434311
m3,1,1025513
m4,2,70350
m5,2,101135

source

JWAS.GWAS — Method

GWAS(marker_effects_file;header=true)

Compute the model frequency for each marker (the probability the marker is included in the model) using samples of marker effects stored in markereffectsfile.

source

JWAS.SSBRrun — Function

(internal) Incomplete Genomic Data (Single-Step)

reorder in A (pedigree) as ids for genotyped then non-genotyped inds
impute genotypes for non-genotyped individuals
add ϵ (imputation errors) and J as variables in data for non-genotyped inds

source

JWAS.add_genotypes — Function

DEPRECATED!! Please use get_genotypes()

add_genotypes(mme::MME,M::Union{AbstractString,Array{Float64,2},Array{Float32,2},Array{Any,2},DataFrames.DataFrame},G=false;
              header=false,rowID=false,separator=',',
              center=true,G_is_marker_variance=false,df=4)

Get marker informtion from a genotype file or an nxp Matrix M of genotypes (Array or DataFrame), where n is the number of individuals and p is the number of markers. This file/matrix needs to be column-wise sorted by marker positions.
G is the mean for the prior assigned for the genomic variance with degree of freedom df, defaulting to 4.0. If G is not provided, a value is calculated from responses (phenotypes)

If a text file is provided, the file format should be:

Animal,marker1,marker2,marker3,marker4,marker5
S1,1,0,1,1,1
D1,2,0,2,2,1
O1,1,2,0,1,0
O3,0,0,2,1,1

If an nxp Matrix of genotypes (Array or DataFrame) is provided, where n is the number of individuals and p is the number of markers,
- This matrix needs to be column-wise sorted by marker positions.
- rowID is a vector of individual IDs, e.g.,rowID=["a1","b2","c1"]; if it is omitted, IDs will be set to 1:n
- header is a header vector such as ["id"; "mrk1"; "mrk2";...;"mrkp"]. If omitted, marker names will be set to 1:p

source

JWAS.add_term — Method

add to model an extra term: imputation_residual

source

JWAS.annotated_bayesr_mt_pattern — Method

annotated_bayesr_mt_pattern(state)

Map a 2-trait BayesR class state back to the BayesC-style active pattern: class 1 becomes BayesC-style 0, and classes 2:4 become BayesC-style 1.

source

JWAS.annotation_binary_bounds! — Method

annotation_binary_bounds!(lower, upper, response)

Set the truncation bounds for a binary probit step.

For binary indicator z_i ∈ {0, 1}, introduce latent liability l_i such that

z_i = 1(l_i > 0).

Then the full conditional is

l_i | z_i, μ_i ~ N(μ_i, s^2) truncated to:

(-Inf, 0] when z_i = 0
[0, Inf) when z_i = 1

source

JWAS.bayesr_nested_step_indicators — Method

bayesr_nested_step_indicators(delta)

Build the three nested BayesR step-up indicators:

z1_j = 1(δ_j > 1)
z2_j = 1(δ_j > 2)
z3_j = 1(δ_j > 3)

and the corresponding active subsets used by the conditional probit updates.

source

JWAS.block_rhs! — Method

block_rhs!(rhs, Xblock, y, Rinv, unit_weights)

Compute block RHS Xblock' * Diagonal(Rinv) * y in-place without storing Xblock' * Diagonal(Rinv) as a persistent matrix.

source

JWAS.build_model — Function

build_model(model_equations::AbstractString,R=false; df::AbstractFloat=4.0, estimate_variance=true)

Build a model from model equations with the residual variance R. In Bayesian analysis, R is the mean for the prior assigned for the residual variance with degree of freedom df, defaulting to 4.0. If R is not provided, a value is calculated from responses (phenotypes).
By default, all variabels in modelequations are factors (categorical) and fixed. Set variables to be covariates (continuous) or random using functions `setcovariate()orset_random()`.
The argument estimate_variance indicates whether to estimate the residual variance; estimate_variance=true is the default.

#single-trait
model_equations = "BW = intercept + age + sex"
R               = 6.72
models          = build_model(model_equations,R);

#multi-trait
model_equations = "BW = intercept + age + sex
                   CW = intercept + litter";
R               = [6.72   24.84
                   24.84  708.41]
models          = build_model(model_equations,R);

source

JWAS.center! — Method

This function centers columns of the input matrix X by subtracting their means along each column. The function operates in-place by modifying the original matrix X.

Input:
- X::AbstractMatrix: a matrix to be centered

Output:
- col_means::Vector: a vector of mean values for each column in the original matrix, computed before centering.

source

JWAS.check_marker_memory_guard! — Method

check_marker_memory_guard!(; mode, ratio, estimated_bytes, total_memory_bytes, context_string)

Apply guard policy for estimated marker memory usage.

source

JWAS.decode_marker! — Method

decode_marker!(dest, backend, marker_index)

Decode one marker into dest with centering and missing-value mean-imputation.

source

JWAS.estimate_marker_memory — Method

estimate_marker_memory(nObs, nMarkers;
                       element_bytes,
                       has_nonunit_weights=false,
                       block_starts=false,
                       storage_mode=:dense)

Estimate major marker-path memory components in bytes.

source

JWAS.finalize_marker_annotation_setup! — Method

finalize_marker_annotation_setup!(genotypei)

Finalize any annotation state that depends on the model rather than the raw genotype input.

get_genotypes handles annotation validation and stores the raw design information. This finalizer is called from build_model after genotypei.ntraits is known. At that point JWAS can choose the method- and trait-specific annotation structure:

single-trait BayesC uses one binary inclusion step
multi-trait BayesC uses a 3-step tree over 00, 10, 01, and 11 states
multi-trait BayesR uses a 7-step tree over active patterns and BayesR magnitude classes

Single-trait BayesR already has its ordered 4-class annotation structure at genotype loading time, so this finalizer only rebuilds BayesR annotations for the multi-trait case.

source

JWAS.format_bytes_human — Method

format_bytes_human(bytes)

Format byte counts into a compact human-readable string.

source

JWAS.getEBV — Method

getEBV(model::MME,traiti)

(internal function) Get breeding values for individuals defined by outputEBV(), defaulting to all genotyped individuals. This function is used inside MCMC functions for one MCMC samples from posterior distributions. e.g., non-NNBayespartial (multi-classs Bayes) : y1=M1α1[1]+M2α2[1]+M3α3[1] y2=M1α1[2]+M2α2[2]+M3α3[2]; NNBayespartial: y1=M1α1[1] y2=M2α2[1] y3=M3*α3[1];

source

JWAS.getMCMCinfo — Method

getMCMCinfo(model::MME)

(internal function) Print out MCMC information.

source

JWAS.getMME — Method

Construct mixed model equations with

incidence matrix: X ; response : ySparse; left-hand side : mmeLhs ; right-hand side : mmeLhs ;

source

JWAS.get_column_ref — Method

get_column_ref(X::Vector{T})

To obtain a vector of views (alias/pointer) for each column of the input matrix WITHOUT COPYING the underlying data. 
input:  a matrix X
output: a vector containing views of each column of the input matrix X

source

JWAS.get_genotypes — Function

get_genotypes(file::Union{AbstractString,Array{Float64,2},Array{Float32,2},Array{Int64,2}, Array{Int32,2}, Array{Any,2}, DataFrames.DataFrame}, G = false;
              ## method:
              method = "BayesC",Pi = 0.0,estimatePi = true, 
              ## variance:
              G_is_marker_variance = false, df = 4.0,
              estimate_variance=true, estimate_scale=false,
              constraint = false, #for multi-trait only, constraint=true means no genetic covariance among traits
              ## format:
              separator=',',header=true,
              ## quality control:
              quality_control=true, MAF = 0.01, missing_value = 9.0,
              ## others:
              center=true,starting_value=false,
              annotations=false,
              storage=:dense)

Get marker informtion from a genotype file/matrix. This file needs to be column-wise sorted by marker positions.
If a text file is provided, the file format should be:

Animal,marker1,marker2,marker3,marker4,marker5
S1,1,0,1,1,1
D1,2,0,2,2,1
O1,1,2,0,1,0
O3,0,0,2,1,1

If a DataFrame is provided, where n is the number of individuals and p is the number of markers,
- This matrix needs to be column-wise sorted by marker positions.
- The first column in the DataFrame should be individual IDs
- The marker IDs can be provided as the header of the DataFrame. If omitted, marker IDs will be set to 1,2,3...
If an nxp Matrix of genotypes (Array) is provided, where n is the number of individuals and p is the number of markers,
- This matrix needs to be column-wise sorted by marker positions.
- Individual IDs will be set to 1:n;
- Marker IDs will be set to 1:p
If quality_control=true, defaulting to true,
- Missing genotypes should be denoted as 9, and will be replaced by column means. Users can also impute missing genotypes before the analysis.
- Minor allele frequency MAF threshold, defaulting to 0.01, is uesd, and fixed loci are removed.
G is the mean for the prior assigned for the genomic variance with degree of freedom df, defaulting to 4.0. If G is not provided, a value is calculated from responses (phenotypes).
Available methods include "conventional (no markers)", "RR-BLUP", "BayesA", "BayesB", "BayesC", "Bayesian Lasso", and "GBLUP".
In Bayesian variable selection methods, Pi for single-trait analyses is a number; Pi for multi-trait analyses is a dictionary such as Pi=Dict([1.0; 1.0]=>0.7,[1.0; 0.0]=>0.1,[0.0; 1.0]=>0.1,[0.0; 0.0]=>0.1), defaulting to all markers have effects (Pi = 0.0) in single-trait analysis and all markers have effects on all traits (Pi=Dict([1.0; 1.0]=>1.0,[0.0; 0.0]=>0.0)) in multi-trait analysis. Pi is estimated if estimatePi = true, , defaulting to false.
Scale parameter for prior of marker effect variance is estimated if estimate_scale = true, defaulting to false.
annotations enables annotation-aware BayesC/BayesR. Supply a numeric matrix with one row per raw marker; JWAS prepends an intercept column internally after marker QC/filtering. get_genotypes validates and stores this raw annotation information. If a method's annotation prior depends on the eventual number of traits (for example annotated multi-trait BayesC), JWAS finalizes that method-specific internal state later in build_model().
multi_trait_sampler controls how multi-trait BayesC chooses between Gibbs sampler I and II: :I is the default, :auto preserves the existing support-based dispatch, and :II forces joint-state updates.
storage=:dense (default) keeps the existing in-memory dense loading behavior. storage=:stream loads an opt-in packed backend prepared by prepare_streaming_genotypes.

source

JWAS.gibbs_update_bayesc_binary_annotation_coefficients! — Method

gibbs_update_bayesc_binary_annotation_coefficients!(ann)

BayesC binary annotation coefficient update.

This uses the same coordinate update as the binary annotated BayesR steps: the intercept has a flat prior, while annotation slopes are shrunken by ann.variance.

source

JWAS.gibbs_update_binary_probit_annotation_coefficients! — Method

gibbs_update_binary_probit_annotation_coefficients!(coeffs, X, latent_residual, coef_prior_var)

Coordinate Gibbs update for one binary probit annotation submodel.

Write the latent regression as

l = Xα + ε, with ε ~ N(0, I).

After sampling the latent liabilities, we work with the residual

r = l - Xα.

Then each coefficient is updated by a standard scalar Gibbs step:

the intercept uses a flat prior
slopes use α_k ~ N(0, σ^2_α)

source

JWAS.is_unit_weights — Method

is_unit_weights(Rinv)

Return true if all residual weights are exactly one.

source

JWAS.load_streaming_backend — Method

load_streaming_backend(path::AbstractString)

Load a packed genotype backend from a prefix (or .jgb2 / .meta path).

source

JWAS.make_incidence_matrices — Method

make_incidence_matrices(mme,df_whole,train_index)

(internal function) Make incidence matrices for effects involved in

calculation of EBV except marker covariates.

Both incidence matrices for non-missing observations (used in mixed model equations)

and individuals of interest (output IDs) are obtained.

source

JWAS.mkDict — Method

mkDict(a::Vector{T}) where T <: Any

Get column index in the incidence matrix for each level of a factor (categorical variable) 
input:  a=["a1","a4","a1","a2"] 
output: d=Dict("a2" => 3, "a1" => 1, "a4" => 2), level_names=["a1","a4","a2"]

note: enumerate(level_names) gives a list of tuples (index, element), reverse() to reverse (index,element) to (element,index)

source

JWAS.mkmat_incidence_factor — Method

mkmatincidencefactor(yID::Vector, uID::Vector) create an incidence matrix Z to reorder uID to yID by yID = ZuID. input: - yID: a vector containing the desired order of IDs - uID: a vector containing the original order of IDs output: - Z: a sparse matrix representing the incidence relationship between yID and uID (yID = ZuID)

source

JWAS.outputEBV — Method

outputEBV(model,IDs::Array)

Output estimated breeding values and prediction error variances for IDs.

source

JWAS.outputMCMCsamples — Method

outputMCMCsamples(mme::MME,trmStrs::AbstractString...)

Get MCMC samples for specific location parameters.

source

JWAS.output_MCMC_samples — Function

output_MCMC_samples(mme,vRes,G0,outfile=false)

(internal function) Save MCMC samples every outputsamplesfrequency iterations to the text file.

source

JWAS.output_MCMC_samples_setup — Function

output_MCMC_samples_setup(mme,nIter,output_samples_frequency,file_name="MCMC_samples")

(internal function) Set up text files to save MCMC samples.

source

JWAS.output_location_parameters_samples — Method

output_location_parameters_samples(mme::MME,sol,outfile)

(internal function) Save MCMC samples for location parameers

source

JWAS.prediction_setup — Method

prediction_setup(mme::MME)

(internal function) Create incidence matrices for individuals of interest based on a usere-defined

prediction equation, defaulting to genetic values including effects defined with genomic and pedigre information. For now, genomic data is always included.

J and ϵ are always included in single-step analysis (added in SSBR.jl)

source

JWAS.prepare_streaming_genotypes — Method

prepare_streaming_genotypes(file::AbstractString;
                            output_prefix=nothing,
                            separator=',',
                            header=true,
                            missing_value=9.0,
                            quality_control=true,
                            MAF=0.01,
                            center=true,
                            conversion_mode=:lowmem,
                            auto_dense_max_bytes=2^30,
                            tmpdir=nothing,
                            cleanup_temp=true,
                            disk_guard_ratio=0.9)

Convert a dense text genotype file to a marker-major 2-bit packed backend.

Conversion backend selection:

conversion_mode=:lowmem uses out-of-core staged conversion (disk-backed).
conversion_mode=:dense uses in-memory conversion.
conversion_mode=:auto chooses between the two based on auto_dense_max_bytes.

Low-memory conversion options:

tmpdir: optional location for temporary conversion files.
cleanup_temp: remove temporary files after successful conversion.
disk_guard_ratio: fail fast when estimated required bytes exceed disk_guard_ratio * available_bytes.

source

JWAS.runMCMC — Method

runMCMC(model::MME,df::DataFrame;
        #Data
        heterogeneous_residuals           = false,
        #MCMC
        chain_length::Integer             = 100,
        starting_value                    = false,
        burnin::Integer                   = 0,
        output_samples_frequency::Integer = chain_length>1000 ? div(chain_length,1000) : 1,
        update_priors_frequency::Integer  = 0,
        #Methods
        single_step_analysis            = false, #parameters for single-step analysis
        pedigree                        = false, #parameters for single-step analysis
        fitting_J_vector                = true,  #parameters for single-step analysis
        categorical_trait               = false,
        censored_trait                  = false,
        causal_structure                = false,
        mega_trait                      = false,
        missing_phenotypes              = true,
        constraint                      = false,
        #Genomic Prediction
        outputEBV                       = true,
        output_heritability             = true,
        prediction_equation             = false,
        #MISC
        seed                            = false,
        printout_model_info             = true,
        printout_frequency              = chain_length+1,
        big_memory                      = false,
        double_precision                = false,
        fast_blocks                     = false,
        independent_blocks              = false,
        memory_guard                    = :error,
        memory_guard_ratio              = 0.80,
        ##MCMC samples (defaut to marker effects and hyperparametes (variance components))
        output_folder                     = "results",
        output_marker_effect_samples      = true,
        output_samples_for_all_parameters = false,
        ##for deprecated JWAS
        methods                         = "conventional (no markers)",
        Pi                              = 0.0,
        estimatePi                      = false)

Run MCMC for Bayesian Linear Mixed Models with or without estimation of variance components.

Markov chain Monte Carlo
- The first burnin iterations are discarded at the beginning of a MCMC chain of length chain_length.
- Save MCMC samples every output_samples_frequency iterations, defaulting to chain_length/1000, to a folder output_folder, defaulting to results. MCMC samples for hyperparametes (variance componets) and marker effects are saved by default. Set output_marker_effect_samples=false to skip writing the large marker-effect sample files while keeping final marker-effect summaries and smaller MCMC sample files such as marker variances and pi. MCMC samples for location parametes can be saved using function output_MCMC_samples(). Note that saving MCMC samples too frequently slows down the computation.
- The starting_value can be provided as a vector for all location parameteres and marker effects, defaulting to 0.0s. The order of starting values for location parameters and marker effects should be the order of location parameters in the Mixed Model Equation for all traits (This can be obtained by getNames(model)) and then markers for all traits (all markers for trait 1 then all markers for trait 2...).
- Miscellaneous Options
  - Priors are updated every update_priors_frequency iterations, defaulting to 0.
Methods
- Single step analysis is allowed if single_step_analysis = true and pedigree is provided.
- Miscellaneous Options
  - Missing phenotypes are allowed in multi-trait analysis with missing_phenotypes=true, defaulting to true.
  - Catogorical Traits are allowed if categorical_trait=true, defaulting to false. Phenotypes should be coded as 1,2,3...
  - Censored traits are allowed if the upper bounds are provided in censored_trait as an array, and lower bounds are provided as phenotypes.
  - If constraint=true, defaulting to false, constrain residual covariances between traits to be zeros.
  - If causal_structure is provided, e.g., causal_structure = [0.0 0.0 0.0;1.0 0.0 0.0;1.0 0.0 0.0] for trait 1 -> trait 2 and trait 1 -> trait 3 (column index affacts row index, and a lower triangular matrix is required), phenotypic causal networks will be incorporated using structure equation models.
Genomic Prediction
- Predicted values for individuals of interest can be obtained based on a user-defined prediction equation prediction_equation, e.g., "y1:animal + y1:age".
For now, genomic data is always included. Genetic values including effects defined with genotype and pedigree information are returned if prediction_equation= false, defaulting to false.
- Individual estimted genetic values and prediction error variances (PEVs) are returned if outputEBV=true, defaulting to true. Heritability and genetic
variances are returned if output_heritability=true, defaulting to true. Note that estimation of heritability is computaionally intensive.
Miscellaneous Options
- Print out the model information in REPL if printout_model_info=true; print out the monte carlo mean in REPL with printout_frequency, defaulting to false.
- If seed, defaulting to false, is provided, a reproducible sequence of numbers will be generated for random number generation.
- If big_memory=true, defaulting to false, a machine with lots of memory is assumed which may speed up the analysis.
- fast_blocks enables block marker updates. It can be false, true, a numeric block size, or a vector of explicit marker block starts such as [1, 501, 1201].
- independent_blocks=false keeps the exact sequential fast-block sweep. Set independent_blocks=true only when fast_blocks != false and you explicitly want the approximate mode that updates blocks from a sweep-level residual snapshot before reconciling all block changes.
- memory_guard controls the marker-memory precheck before MCMC (:error, :warn, :off; default :error).
- memory_guard_ratio sets the allowed fraction of Sys.total_memory() for the precheck (default 0.80).

source

JWAS.sample_annotation_effect_variance — Method

sample_annotation_effect_variance(coeffs)

Update the slope variance for one annotation step using the same scaled inverse-chi-square form as Jian's sbayesrc.R:

σ^2_α = (Σ_{k>1} α_k^2 + 2) / χ^2_{p+1}

where p is the total number of annotation coefficients including the intercept.

source

JWAS.sample_binary_annotation_liabilities! — Method

sample_binary_annotation_liabilities!(liability, mu, lower, upper, response; latent_sd)

Sample the latent liabilities for one binary probit step after the truncation bounds have been determined by annotation_binary_bounds!.

This is the common latent-variable update used by:

annotated BayesC for the single inclusion step
annotated BayesR for each of the nested step-up indicators

source

JWAS.sample_nested_annotation_probit_step! — Method

sample_nested_annotation_probit_step!(ann, step, response, active)

Sample one conditional BayesR annotation step.

For step-specific conditional probabilities (p1, p2, p3), BayesR reconstructs the 4-class per-marker prior as

π_{j1} = 1 - p1_j
π_{j2} = p1_j (1 - p2_j)
π_{j3} = p1_j p2_j (1 - p3_j)
π_{j4} = p1_j p2_j p3_j

This helper updates one binary probit submodel that contributes to p_step.

source

JWAS.set_covariate — Method

set_covariate(model::MME,variables::AbstractString...)

set variables as covariates; model is the output of function build_model().

#After running build_model, variabels age and year can be set to be covariates as
set_covariate(model,"age","year")
#or
set_covariate(model,"age year")

source

JWAS.set_random — Function

set_random(mme::MME,randomStr::AbstractString,ped::Pedigree, G;df=4)

set variables as random polygenic effects with pedigree information ped. and variances G.
G is the mean for the prior assigned for the variance with degree of freedom df, defaulting to 4.0. If G is not provided, a value is calculated from responses (phenotypes).

#single-trait (example 1)
model_equation  = "y = intercept + age + animal"
model           = build_model(model_equation,R)
ped             = get_pedigree(pedfile)
G               = 1.6
set_random(model,"animal", ped, G)

#single-trait (example 2)
model_equation  = "y = intercept + age + animal + animal*age"
model           = build_model(model_equation,R)
ped             = get_pedigree(pedfile)
G               = [1.6   0.2
                   0.2  1.0]
set_random(model,"animal animal*age", ped,G)

#multi-trait
model_equations = "BW = intercept + age + sex + animal
                   CW = intercept + age + sex + animal"
model           = build_model(model_equations,R);
ped             = get_pedigree(pedfile);
G               = [6.72   2.84
                   2.84  8.41]
set_random(model,"animal",ped,G)

source

JWAS.set_random — Function

set_random(mme::MME,randomStr::AbstractString,G;Vinv=0,names=[],df=4)

set variables as random effects, defaulting to i.i.d effects, with variances G.
G is the mean for the prior assigned for the variance with degree of freedom df, defaulting to 4.0. If G is not provided, a value is calculated from responses (phenotypes).
the random effects are assumed to be i.i.d by default and it can be defined with any (inverse of) covariance structure Vinv with its index (row names) provided by names.

#single-trait (i.i.d randome effects)
model_equation  = "y = intercept + litter + sex"
model           = build_model(model_equation,R)
G               = 0.6
set_random(model,"litter",G)

#multi-trait (i.i.d randome effects)
model_equations = "BW = intercept + litter + sex
                   CW = intercept + litter + sex"
model           = build_model(model_equations,R);
G               = [3.72  1.84
                   1.84  3.41]
set_random(model,"litter",G)

#single-trait (randome effects with specific covariance structures)
model_equation  = "y = intercept + litter + sex"
model           = build_model(model_equation,R)
V               = [1.0  0.5 0.25
                   0.5  1.0 0.5
                   0.25 0.5 1.0]
G               = 0.6
set_random(model,"litter",G,Vinv=inv(V),names=[a1;a2;a3])

source

JWAS.solve — Method

solve(mme::MME,df::DataFrame;solver="default",printout_frequency=100,tolerance = 0.000001,maxiter = 5000)

Solve the mixed model equations (no marker information) without estimating variance components.

Available solvers include default, Jacobi, Gauss-Seidel, and Gibbs sampler.

source

JWAS.streaming_mul_alpha! — Method

streaming_mul_alpha!(out, backend, α)

Compute out = X*α from a streaming backend without materializing dense X.

source

JWAS.transubstrarr — Method

transubstrarr(vec)

(internal function) Transpose a column vector of strings (vec' doesn't work here)

source

JWAS.update_bayesc_binary_bounds! — Method

update_bayesc_binary_bounds!(Mi)

Annotated BayesC is the binary inclusion special case. The thresholds remain the current BayesC convention, and this refactor preserves that behavior.

source

JWAS.Packed2BitBackend — Type

Packed2BitBackend

Streaming backend for marker-major 2-bit packed genotypes used by storage=:stream in get_genotypes.

source

JWAS.PedModule.get_info — Method

get_info(pedigree::Pedigree;Ai=false)

Print summary informtion from a pedigree object including number of individulas, sires. dams and founders. Return individual IDs, inverse of numerator relationship matrix, and inbreeding coefficients if Ai=true.

source

JWAS.PedModule.get_pedigree — Method

get_pedigree(pedfile::AbstractString;header=false,separator=',',missingstring=["0"])

Get pedigree informtion from a pedigree file with header (defaulting to false) , separator (defaulting to ,) and missing values (defaulting to ["0"])
Pedigree file format:

a,0,0
c,a,b
d,a,c

source

–>