Proving the Superiority of Systematic Designs in On-Farm Trials

AAGI Annual Science Symposium

Zhanglong Cao (CBADA-CCDM, Curtin University)
Jordan Brown, Mark Gibberd, Julia Easton, Suman Rakshit

November 14, 2024

Puzzles

Life is like a puzzle; we piece together different parts to see the bigger picture. Working on OFE is no different. it’s like solving a giant puzzle, except sometimes you don’t know where to start, and sometimes the pieces are missing, and other times, they don’t fit at all! But when they do, it’s a masterpiece.

The first piece: GWR GWR Piece

For constructing the spatial map, we used Geographically Weighted Regression (GWR) compute the regression coefficients at regular grid of points covering the study region (Rakshit et al. (2020)).

  • Estimation based on the local-likelihood

  • Estimate local regression coefficients

source: Fotheringham, Brunsdon, and Charlton (2003)

The second piece: BHM BHM Piece

  • The Bayesian hierarchical model (BHM) is a powerful tool for spatial data analysis.

  • It is a flexible approach to model spatially correlated data.

  • The BHM is used to estimate the regression coefficients at all grid points simultaneously (Cao et al. (2022)).

  • The BHM is a complementary approach to GWR.

In GWR, the results crucially depends on the bandwidth of the selected kernel function. Although an appropriate bandwidth can be selected using spatial cross validation, it is computationally challenging for large data sets. To estimate the regression parameters for a query location, the neighbouring observations are given more weight than the distant ones in GWR. On the contrary, the proposed Bayesian approach uses all data in one go to produce estimates for all grid point, based on a spatial variance matrix defined for the entire field. The Bayesian inference is affected by the choice of priors and the likelihood. However, the influence of the prior reduces if the amount of data increases. The Bayesian approach in general is more flexible than GWR, as it can be easily extended and applied broadly to other applications.

The second piece: BHM BHM Piece

At location sisi, the model is written as

y(si)=∑m=1lbmxm(si)+∑j=1huj(si)zj(si)+e(si),ui∣θu∼(0,Vu(θu)),e(si)∣σe∼(0,σ2e).y(si)=∑m=1lbmxm(si)+∑j=1huj(si)zj(si)+e(si),ui∣θu∼N(0,Vu(θu)),e(si)∣σe∼N(0,σe2).

In matrix notation,

Y∼(Xb+Zu,e).Y∼N(Xb+Zu,e).

Spatially correlated random parameters

For all u={u1,…,un}u={u1,…,un}, the variance is

Σu=In⊗Vu.Σu=In⊗Vu.
Or
Σu=Vs⊗Vu.Σu=Vs⊗Vu.

VsVs can be AR1⊗AR1AR1⊗AR1 for regular grid data or Matérn covariance function for irregular grid data.

To incorporate spatial correlation amongst the model parameters in our Bayesian hierarchical modelling framework, we investigate here how the variance-covariance matrix of uu can be specified to represent the spatial correlation across all the grid points si,i=1,…,nsi,i=1,…,n. Note that, at location sisi, the covariance matrix of uiui is VuVu.

Σu=In⊗Vu.Σu=In⊗Vu.
If the correlation between grid points is characterised by a spatial variance-covariance matrix VsVs, the variance-covariance matrix of uu is given by
Σu=Vs⊗Vu,Σu=Vs⊗Vu,
where VsVs may be considered either a AR1×AR1AR1×AR1 spatial variance-covariance matrix or a weighted distance matrix.

Spatially correlated random parameters

Using this structure is because that only a single treatment is directly observed in any one position.

The spatial model allows the exploiting of information from neighbouring positions with other treatments (Piepho et al. (2011)).

Bayesian workflow

Bayesian Workflow (Gelman et al. (2020))

Results

Maps for β̂ 0β^0, β̂ 1β^1, and β̂ 2β^2.

Putting two pieces together

 

The third piece: trial designs Simu Piece

  • BHM to generate synthetic OFE data and we know the true values bb and uu.

  • GWR to fit the data and estimate coefficients β̂ β^.

  • Evaluation: Mean Squared Errors

    MSEj=1n∑i=1n(β̂ ij−(bi+ui))2MSEj=1n∑i=1n(β^ij−(bi+ui))2
    where j=0,1j=0,1 or j=0,1,2j=0,1,2 (Cao et al. (2024)).

Different designs

A randomised design

A systematic design.
  • 5 treatments, 4 replicates, 93 rows ×× 20 ranges

each replicate is 93 ×× 5

Other factors

  • Response types: linear and quadratic

    {LinearQuadraticyi=b0+u0i+(b1+u1i)Ni+eiyi=b0+u0i+(b1+u1i)Ni+(b2+u2i)N2i+ei,{Linearyi=b0+u0i+(b1+u1i)Ni+eiQuadraticyi=b0+u0i+(b1+u1i)Ni+(b2+u2i)Ni2+ei,

  • Variance-covariance of the random effects, Vs=IVs=I, Vs=AR1(ρc)⊗AR1(ρr)Vs=AR1(ρc)⊗AR1(ρr) and Matérn covariance

    Vs(d)=σ221−νΓ(ν)(2ν‾‾‾√dr)νKν(2ν‾‾‾√dr).Vs(d)=σ221−νΓ(ν)(2νdr)νKν(2νdr).

  • Bandwidth selection

  • Correlation intensity

Results

MSE of a linear response

MSE of a quadratic response

Putting three pieces together

Putting three pieces together

A new piece

The PE approach PE Piece

  • Pseudo-environments (PE) are created by grouping the grid points based on the spatial correlation structure (Stefanova et al. (2023)).

  • The PE approach is used to estimate the fixed effects.

  • The PE approach is for categorical variables.

The PE approach GWR Piece

Irregular grid

Regular grid

The fifth element

A new piece

The fifth element Simu2 Piece

  • BHM again to generate the synthetic data.

yijk=β0+βEnvi+βEnvi×varietyj+uEnvi×Repk+ϵijkyijk=β0+βEnvi+βEnvi×varietyj+uEnvi×Repk+ϵijk
with:

uEnvi×Repk∼(0,σ2Envi×Repk)uEnvi×Repk∼N(0,σEnvi×Repk2)
and:
ϵi∼(0,Σi)ϵi∼N(0,Σi)

The fifth element Simu2 Piece

A randomised design

A systematic design

The fifth element Simu2 Piece

Evaluation

The Mean Squared Error (MSE) for the fixed effects is calculated as:

MSEfixed=1n∑i=1n(β̂ i−βi)2MSEfixed=1n∑i=1n(β^i−βi)2
where β̂ iβ^i are the estimated fixed effects and βiβi are the true fixed effects.

The MSE for the random effects is calculated as:

MSErandom=1m∑j=1m(û j−uj)2MSErandom=1m∑j=1m(u^j−uj)2
where û ju^j are the estimated random effects and ujuj are the true random effects.

Preliminary results

MSE

Preliminary results

MSE for fixed effects and random effects across different designs over 2000 iterations.
Type Design Mean Median Min Max Q1 Q3
Fixed Effects Randomised 0.998 0.879 0.0605 5.02 0.542 1.33
Fixed Effects Systematic 0.952 0.835 0.0369 4.76 0.503 1.26
Random Effects Randomised 0.402 0.335 0.0102 1.19 0.140 0.637
Random Effects Systematic 0.397 0.327 0.0100 1.17 0.138 0.626

Results

  • The puzzle is incomplete without the fifth piece.
  • The puzzle is still incomplete with the fifth piece.

Puzzles

Conclusion

A systematic design is superior to a randomised design for OFE when

  • spatial variation presents

  • quadratic response is assumed

Additionally,

  • the statement still holds for categorical variables

  • PE-approach shows it’s robustness in estimating fixed effects

Limitations

  • The proposed Bayesian approach is computationally intensive

  • The choice of parameters in the model can affect the results

  • For LMM PE approach, more variables can be incorporated

References

Cao, Zhanglong, Jordan Brown, Mark Gibberd, Julia Easton, and Suman Rakshit. 2024. “Optimal Design for on-Farm Strip Trials—Systematic or Randomised?” Field Crops Research 318: 109594.
Cao, Zhanglong, Katia Stefanova, Mark Gibberd, and Suman Rakshit. 2022. “Bayesian Inference of Spatially Correlated Random Parameters for on-Farm Experiment.” Field Crops Research 281: 108477.
Fotheringham, A Stewart, Chris Brunsdon, and Martin Charlton. 2003. Geographically Weighted Regression: The Analysis of Spatially Varying Relationships. John Wiley & Sons.
Gelman, Andrew, Aki Vehtari, Daniel Simpson, Charles C Margossian, Bob Carpenter, Yuling Yao, Lauren Kennedy, Jonah Gabry, Paul-Christian Bürkner, and Martin Modrák. 2020. “Bayesian Workflow.” arXiv Preprint arXiv:2011.01808.
Piepho, Hans-Peter, Christel Richter, Joachim Spilke, Karin Hartung, and Arndt Kunick. 2011. “Statistical Aspects of on-Farm Experimentation.” Crop & Pasture Science 62: 721–35. https://doi.org/10.1071/cp11175.
Rakshit, Suman, Adrian Baddeley, Katia Stefanova, Karyn Reeves, Kefei Chen, Zhanglong Cao, Fiona Evans, and Mark Gibberd. 2020. “Novel Approach to the Analysis of Spatially-Varying Treatment Effects in on-Farm Experiments.” Field Crops Research 255 (October 2019): 107783. https://doi.org/gg2vv7.
Stefanova, Katia T, Jordan Brown, Andrew Grose, Zhanglong Cao, Kefei Chen, Mark Gibberd, and Suman Rakshit. 2023. “Statistical Analysis of Comparative Experiments Based on Large Strip on-Farm Trials.” Field Crops Research 297: 108945.

Thank you.

39 / 39
Thank you.

  1. Slides

  2. Tools

  3. Close
  • Proving the Superiority of Systematic Designs in On-Farm Trials
  • \[ \newcommand{\E}{\mathrm{E}}...
  • Puzzles
  • The first piece: GWR
  • The second piece: BHM
  • The second piece: BHM
  • Spatially correlated random parameters
  • Spatially correlated random parameters
  • Bayesian workflow
  • Slide 10
  • Results
  • Putting two pieces together
  • The third piece: trial designs
  • Different designs
  • Other factors
  • Slide 16
  • Results
  • Putting three pieces together
  • Putting three pieces together
  • The PE approach
  • The PE approach
  • The fifth element
  • The fifth element
  • The fifth element
  • The fifth element
  • Evaluation
  • Slide 27
  • Preliminary results
  • Preliminary results
  • Results
  • Puzzles
  • Conclusion
  • Limitations
  • Slide 34
  • Slide 35
  • Slide 36
  • Slide 37
  • References
  • Thank you.
  • f Fullscreen
  • s Speaker View
  • o Slide Overview
  • e PDF Export Mode
  • b Toggle Chalkboard
  • c Toggle Notes Canvas
  • d Download Drawings
  • ? Keyboard Help