Study design and sample collection
The idea behind design and sampling
- The samples you choose should be representative of the broader population you wish to study
- The variability between your groups should be due to your research question (and not some other thing)
Types of replicates
- Biological replicates
- Technical replicates
Sources of variation in samples
Experimental (due to our question):
Non-experimental (due to things other than our question)
- Personnel
- Collection materials
- Storage (e.g., freeze-thaw, age)
- Instrument issues
- Run order effects
Ideally, measurement variability <<< sample variability
What can we do to minimize non-experimental variation?
- Consistency!
- Choose appropriate controls
- Ensure sample groups are handled in the same way
- Randomize extraction order
- Randomize run order
- Run in one batch
Quality control (QC) samples
- How do we know if our data looks ‘okay’ if we don’t know what it’s supposed to look like?
- Pooled QCs, commercial QCs, synthetic QCs
QC samples should cluster together
![]()
Fig. adapted from Broadhurst et al., Metabolomics 2018
QC adjustment
![]()
Fig. from Dunn et al., Nature Protocols 2011
Timing of collection
![]()
Fig. from Sato et al., Molecular Metabolism 2018
Collection variables to consider
- Tubes
- Storage
- Freeze-thaw
- Quenching
Standarizing extraction amoung
Compare samples by:
- Mass
- Volume
- Number of cells
- Protein content
- Some other thing?