Biolayne How To Read Research
Biolayne How To Read Research
Independent Variables
Independent variables are what is being manipulated by
the researcher to determine the relationship or affect
it has on another variable. Independent variables are
also known as the experimental or treatment variable,
input, cause or stimulus. For example, an independent
variable could be the type of diet subjects are following
(i.e. high carb, high fat, low carb, etc.). Independent
variables can also have different levels. For example,
if a training study is evaluating high, moderate and
low training volume and muscle hypertrophy, training
volume would be the independent variable with the
different levels being high, moderate and low.
                                                             effects they may have on the relationship between the
Dependent Variables                                          independent and dependent variables 2. The caloric
Dependent variables are measured following a                 intake in a diet study could be viewed as a control
treatment or stimulus. Dependent variables are               variable when comparing two different types of diets.
known as the output or response variable and they are
observed or measured to determine the effect of the          Extraneous Variables
independent variable 2. The dependent variable changes       Extraneous variables are factors that can influence the
as a result of the manipulation of the independent           relationship between the independent and dependent
variable. Examples of dependent variables are body           variables, but it is not identified or controlled in the
composition, strength, resting metabolic rate, blood         study 3. This can cause spurious associations between
hormones, etc. If a study is investigating high fat vs.      variables. There may be an association between the
low fat diet and weight loss, weight loss would be           independent and dependent variables but could be due
considered the dependent variable while the type of          to both variables being affected by a third unknown or
diet would be considered the independent variable.           uncontrolled variable (extraneous). For example, let’s
                                                             assume a study is examining differences in weight loss
Control Variables                                            when following a high carb/low fat diet or a high fat/low
Control variables are factors that could influence the       carb diet and let’s say they don’t equate calories. By not
results and are left out of the study 3. Control variables   having any control over caloric intake that could be an
are not a part of a study and instead controlled by the      extraneous variable because it can impact the changes
researcher to cancel out or neutralize any potential         between groups irrespective of the type of diet.
How To Read Research: A Biolayne Guide                                                                               4
Extraneous variables are usually identified following an     while measuring fat loss or muscle growth would be
experiment when associations between variables have          considered an applied form of research because they
been identified and examined further. They can also          are performed in real world settings with limited control
be identified by researchers during the study design,        over the environment.
but because of lack of resources researchers may be
unable to control or account for a specific variable.        Quantitative Research
Other variables known as confounding variables and           Quantitative research is the most common type
covariates are similar to extraneous variables and           of research you will find in exercise and nutrition
often used synonymously, but slightly different. Just        science. Quantitative research is concerned with
know that extraneous variables, confounding variables        numbers and groups, the aim is to determine the
and covariates are additional, unknown variables
                                                             relationship between variables 4. The relationships
that weren’t identified or controlled in the study and
                                                             between variables are expressed through statistical
have some type of impact on the independent and
                                                             analysis (we’ll cover later). This type of research is
dependent variables.
                                                             objective and tightly follows the scientific method and
                                                             seeks to determine a cause and effect. Studies that
                                                             are classified as quantitative research can be further
                                                             classified into two different study types known as
Types of Research                                            experimental and descriptive (observational).
There are many different types of research to                Experimental - Experimental research involves the
answer different kinds of questions and problems.            manipulation of treatments or interventions. The
The different types and categories of research are           aim of experimental research is to establish cause-
limitless, we will discuss the common types that             and-effect relationships and commonly utilizes
are generally incorporated into exercise and sports          some form of randomization (discussed below) 3.
science research.                                            Experimental studies require diligent control over
                                                             variables and other factors that may impact the
Basic vs. Applied                                            outcomes of a study. Experimental studies are also
Research in exercise and nutrition science can be            known as longitudinal or repeated-measure studies 4.
placed somewhere on a spectrum between basic and             Experimental studies measure subjects before and
applied research 3. Basic research is commonly referred      following treatments or interventions. This type
to as “bench science”. Basic research is difficult and is    of research aims to explain phenomena through
generally done in a laboratory under tightly controlled      controlled manipulation of variables, commonly
conditions. Basic research operates under scientific         viewed as the ‘gold-standard’ for research.
theories and often involves animals, but the relevance
or direct value to practitioners is limited 3. You can       Descriptive - Descriptive research is also known as
think of this type of research as a scientist in a lab       observational research and measures things as they are
with pipettes and cell cultures, studying underlying         without intervening 4. There is no attempt to change or
molecular mechanisms. In contrast, applied research          modify certain behaviors. This type of research doesn’t
is limited in the type of control it offers, but it’s much   attempt to determine cause and effect (although
more practical and carries high ecological validity.         many media outlets and even researchers are guilty
Meaning, it applies to real-world settings/conditions.       of attempting to infer causation from these results)
This type of research involves human subjects and            and instead characterizes phenomena as it exists.
is based on common practice and experiences.                 This type of research is less controlled and utilizes
Comparing different diet and training programs               questionnaires, interviews and observation.
How To Read Research: A Biolayne Guide                                                                               5
Qualitative Research
Qualitative research is concerned with words and
                                                            Study Designs
individuals. Qualitative research is more subjective
and seeks understanding of multiple realities/truths        Animal models
and requires constant comparison and revision.              Animal model research commonly includes rats or
Qualitative research rarely develops hypotheses prior       mice as subjects to perform more intensive and
to the study and instead uses more general questions        controlled experiments. Other species are included
to guide the study 3. Qualitative research has been         in various types of research and many debate the
growing interest in the field of exercise science and       ethical considerations associated with this design.
is now being included more frequently. This type of         Nevertheless, humans share many anatomical and
research has been historically used in social sciences      physiological similarities with different animals, which
like psychology, sociology, and anthropology 5. This type   allows investigation into underlying mechanisms.
of research is concerned with behaviors like attitude,      Animal models allow for testing of novel therapies
beliefs, motivation and perception, all of which are        before applying to humans, although not all results can
becoming popular in the field of exercise science and       be directly translated to humans 6.
sports medicine. Qualitative research is frequently used
to evaluate community and school physical activity          Controlled Trials
programs to understand the less tangible outcomes           Controlled trials include a group that does not receive
like the participants attitudes and experiences about       a specific treatment or intervention. This is called the
a program of interest 5. Qualitative methods of data        control group and either receives nothing at all or a
collection can include open-ended questionnaires,           placebo.
interviews or market research focus groups 5.
                                                            Meta-Analysis
                                                            Meta-analyses include the results of two or more
                                                            studies. Meta-analyses were first introduced in 1976
general format including an introduction, methods,          by Gene Glass and defined as “a technique of literature
results, discussion and conclusions. Original research      review that contains a definitive methodology and
is considered a primary source and includes data and        quantifies the results of various studies to a standard
results that have not been published previously.            metric that allows the use of statistical techniques
                                                            as a means of analysis” 3. Meta-analyses can be
Narrative (Literature) Review                               distinguished from literature reviews because they
Narrative reviews are considered secondary sources          include a definitive methodology for including specific
and provide a review and general consensus on a             studies in the literature analysis, and the results of
specific topic. Authors collect relevant, primary           various studies are quantified to a standard metric
source articles relating to a specific topic and provide    called effect size (which we will cover later) 3. Different
a summary of the most current and relevant evidence         from systematic reviews, they use various statistical
pertaining to that topic. Narrative reviews are different   methods to combine and analyze the data of a number
from systematic reviews in that they are based on           of studies. Meta-regressions are an extension of meta-
the opinion of the authors and lack strict control over     analyses and include a more effective and advanced
which studies to include in the review. You can think of    statistical tool to assess the relationships between
these as an opinion-based article including a collection    variables. Meta-regressions account for covariates or
and summary of original research. These can be helpful      other study characteristics of interest. When carried
when trying to understand concepts, theories or a body      out properly, meta-analyses are considered the highest
of evidence regarding a specific topic but be careful       quality of scientific study.
accepting them as truth since it’s only the opinion of
the researchers who wrote it. These reviews can be
subject to confirmation bias and cherry picking studies
that fit their narrative.
How To Read Research: A Biolayne Guide                                                                                8
                                                          will cover basic statistics and dig into challenges faced
Article 02                                                by researchers in exercise and nutritional science. We
                                                          will finish this section with how to trust studies and
Research
                                                          The author line of publications follows a specific
                                                          order. The first author is the one who coordinated
                                                          and had the largest role or responsibility in the study.
                                                          Generally, if this is a graduate student’s project or
Reading research can be a challenging task for
                                                          thesis their mentor or supervisor will be listed last. The
those who are not experienced and educated to read
                                                          remaining order of authors will be based on their level
scientific publications. Before being able to interpret
                                                          of contribution. The general format for peer-reviewed,
results and findings from research, it’s necessary to
                                                          academic publications include five sections known
understand the layout and how to read a study. Most
                                                          as the introduction, methods, results, discussion, and
peer-reviewed journal publications follow a similar and
                                                          conclusion. The abstract is another section, but it is
general format, with minor differences. Understanding     separate from the actual publication.
the general layout of publications will make it easier
to identify key details of studies and understand the     Abstract
findings and takeaways. This section of the guide         After the study title and author line you will find the
focuses on how to read scientific studies and interpret   abstract. The abstract is a paragraph summary of the
their findings. After we cover the general layout and     study. The abstract includes one to two sentences
briefly describe each section of a published study, we    from each of the sections of the publication. Don’t be
How To Read Research: A Biolayne Guide                                                                             9
an abstract warrior and only read the abstract to report         experiment occur and how often did they observe
what the study found. The details are important, and             and measure changes?
findings are accompanied with caveats.
                                                             •   Instrumentation: which devices and methods
Introduction                                                     were used to collect data. How was body
The introduction is the first section of all publications.
                                                                 fat percentage (BF%) tested? Did they use
The introduction includes a discussion of recent and
                                                                 appropriate equipment for what they were
previous studies that relate to the current study of
interest. Intro’s start with more general background             attempting to test? Were their measurements
information and progress into key details and                    valid and reliable?
publications that apply to the current study. The intro
also discusses any controversies between theories            •   Level of control: were the participants in a tightly
or hypotheses and highlights the importance for the              controlled environment (metabolic ward) or was
current study. The intro includes two key pieces of the          this a free-living experiment? Studies that include
study known as the purpose and the hypothesis:                   supervision for resistance training studies are
                                                                 more tightly controlled than studies that allow
Purpose - The purpose of the study is a one to two
                                                                 subjects to train on their own. Studies that provide
sentence that describes the aim or the reason for why
                                                                 food to subjects during diet studies have more
the study is being carried out.
                                                                 control over studies that rely on self-reported
Hypothesis - Based on previous research and                      nutritional intake. Ethical and diligent researchers
understanding researchers develop what’s known                   will specify their studies strengths and limitations
as a hypothesis, a short explanation of the predicted            in the discussion but paying close attention to the
results. Hypotheses cannot be proven, but when the               details in the methods will allow you to identify the
data backs up the hypothesis it is “supported” and when          level of control in a particular study.
it doesn’t its “rejected” 10.
                                                           Conclusion
                                                           Everyone knows what a conclusion is, but in this
Main Time Effect - This simply explains if there was       short section authors give a final summary of the
a significant change in the dependent variable from        main takeaways and practical recommendations.
baseline to post-testing for all subjects. Referring to    This is a more concise version of the discussion,
the table above, this will tell us if there was a change   short and practical.
in body weight from baseline to post-testing for both
groups (high carb & high fat) combined.
Concepts
                                                           throughout exercise and nutrition research. Again, this
                                                           is a brief section and doesn’t even scratch the surface
                                                           of the broader and more complex statistical methods
                                                           that exist. Statistics operate under a number of as-
                                                           sumptions and rules, if these are violated, they can
Overview of Statistics
                                                           misrepresent the data. Statistics is not our area of ex-
Most people cringe at the word statistics and we un-
                                                           pertise and it’s important to realize that if you don’t fully
derstand why. Math and statistics can be complex and
                                                           understand statistics they can be misused to deceive
difficult to understand. There are various meanings for
                                                           people into believing the data is more appealing than it
the word statistics, which adds to its confusion. With     actually may be.
a mixture of math and logic, statistics is a branch of
mathematics that is concerned with collection, anal-       Percent Change
ysis and interpretation of data . Data are scores and
                                         3
                                                           Very simply, this is the change between two values ex-
values that we obtain from measuring the outcomes          pressed as a percentage. You have to be careful with
(dependent variables) of interest in a study. Collect-     percentage change because it can sometimes appear
ing data is only one piece of the puzzle, if researchers   to be a greater change than it actually is. That’s why you
don’t know what to do with the data and how to properly    also want the raw or true values. For example, if a study
describe the data, then the findings may seem under-       is looking at leptin changes and they have a baseline
whelming. Statistics are a way of describing data char-    value of 0.3ng/mL and a post-test value of 1.0ng/mL,
acteristics and examining the relationships between        the absolute change is 0.7ng/mL, but the percentage
How To Read Research: A Biolayne Guide                                                                                12
change is 233% [(1 - 0.3) / (0.3 x 100)]. While this change   Standard Deviation
is minimal and may not be meaningful, the percentage          The standard deviation is concerned with the variabili-
change can make it appear as if it’s a big deal.              ty or the spread of a data set. As previously mentioned,
                                                              the mean is the central point of a data set and the stan-
Central Tendency                                              dard deviation is an estimate of the variability around
The mean is probably one of the most commonly un-             that central point. In other words, the standard devi-
derstood mathematical terms. The mean describes               ation represents the typical amount that a score devi-
the average value of a group of numbers. In statistics,       ates from the mean. When the standard deviation is low
the mean is a measure of central tendency, which rep-         that means the spread or dispersion of scores is small
resents a central or balance point within a set of data 10.   and more tightly grouped closer to the mean. When the
The mode and median are similar to the mean because           standard deviation is large it signifies a widespread or
they represent centrality, but technically they’re slight-    high variability of scores, when this occurs the mean
ly different. Mode refers to the most frequent value          may not be a good representation of the data. The
that appears in a data set, which may or may not be           mean and standard deviation are forms of descrip-
close to the mean. Median refers to the middle point of       tive statistics which is useful for summarizing the
a data set, in other words 50% of the scores will fall un-    data of a specific group. Meaning, they are only able
der the median. For example, let’s assume the following       to describe the data we have accrued, it cannot tell
10 scores were collected during an experiment:                us if the results we acquired will happen again. Oth-
                                                              er statistical tests can fall under another form known
6 6 6 10 11 12 14 14 16 17                                    as inferential statistics, which can allow (not always)
                                                              for conclusions and generalizations of a sample to the
                                                              larger population.
Mean = 11.2 The average of all scores
(6+6+6+10+11+12+14+14+16+17 **/** 10)
                                                              P-value
                                                              Probability is the underlying concept of p-values,
Median = 11.5 Middle value
                                                              which is the likelihood that something will occur.
(5 scores below and 5 scores above this value)
                                                              P-values reflect the level of significance, and the odds
                                                              that the findings are due to chance, it’s impossible to
Mode = 6 Most frequent score
                                                              have a p-value of 0 3. In exercise science the p-value is
                                                              considered to be ‘significant’ at p < 0.05. Meaning, re-
If the data set had an odd number of values, then the
                                                              searchers believe that the odds of their findings occur-
middle value is simply the median (ex. 1, 2, 3; 2 would be
                                                              ring by chance are 5 in 100 or they are 95% sure the re-
the median). Just remember there are slightly differ-         sults were not by chance and the observed differences
ent ways to describe central tendency, but most often         were a real change. In the results section when chang-
you’ll hear about the mean since mode and median are          es of a specific variable are reported there is a p-value
only reported for certain instances. When evaluating          reported after (e.g., 103.5 ± 15.1 ng/dL (*p* = 0.02)). In
data based on calculated means it’s important to iden-        exercise and nutritional science, if the p-value is great-
tify any outliers or extreme values in the data. Outliers     er than 0.05 the result isn’t deemed to be significant.
and high variability of data can produce inflated or mis-     This is also stated as ‘supporting the null hypothesis’.
leading results because the mean is sensitive to outli-       The null hypothesis states there isn’t a relationship or
ers and extreme values. In contrast, the median is not        difference and instead the findings are due to sam-
sensitive to outliers and extreme values, meaning the         pling error or random chance. Statistical tests are per-
median won’t change if there is a greater spread in the       formed to either support or reject the null hypothesis
data. If the mean is being reported it’s important to also    and anything less than 0.05 rejects the null hypothesis
take note of the standard deviation to account for this.      and accepts the research hypothesis. Statistical sig-
How To Read Research: A Biolayne Guide                                                                                13
nificance is what you should identify when interpreting      for meta-analytic conclusions, and they are commonly
results, but significant differences aren’t the only thing   used for future study planning using a power analysis 11.
you want to focus on. A study might show that one type       Effect sizes can be interpreted based on recommenda-
of diet lost significantly more weight than another type     tions by Cohen 1988, which states that effect sizes can
of diet, but what if it was only by 0.5 lbs? That doesn’t    range from small (d = 0.2), medium (d = 0.5), and large (d
mean much, but how do you determine if significant re-       = 0.8) 12. Larger effect sizes are more significant. Effect
sults are meaningful? While p-values provide statistical     sizes are also commonly used to plan future studies by
significance, effect sizes allow researchers to commu-       predicting the sample size needed to detect a differ-
nicate practical significance of their results 11.           ence, this type of test is known as a power analysis.
Line Graph
Line graphs depict related data points that are con-
nected with a line, sometimes they include symbols
[13]. Line graphs are great when comparing time trials
where there are multiple testing points over a period
of time. For example, comparing the response of two               25 percent of the scores fall under this line.
different supplement treatments over a predefined pe-          2. Median: The median (as described previously) is
riod of time.                                                     the middle value and 50 % of the scores fall under
                                                                  this value.
Box and Whisker Plots
Box and whisker plots (box plots) are used to depict the       3. Q2: is the right side of the rectangle and rep-
distribution of a data set. Once you understand each              resents the 75th percentile, meaning 75% of the
component of a box plot, you’ll realize how simple and            scores fall below this value.
effective they can be at summarizing a set of scores.
Usually box plots are vertical, but we have provided an        4. Whiskers: The whiskers can be found on either
Forest Plots
You will mostly see forest plots in Joe Rogan podcasts
with James Wilks… just kidding. You typically see For-
est Plots in meta-analyses because they depict the           under the figure doesn’t explicitly state what kind they
individual results as well as the pooled results of the      are they can be rather meaningless 25. The standard
meta-analysis. Forest plots will indicate the strength of    deviation (SD) bars represent the typical difference be-
the treatment effect with the y-axis containing a list of    tween the data points and their mean, whereas stan-
the studies included in the analysis and the x-axis will     dard error (SE) bars indicate how variable the mean will
have a distinction of what the studies favor (control vs.    be if you repeat the study over and over, and more sub-
treatment) 13. Each study will have their mean symbol-       jects or samples decrease the SE 15. You’ll notice in the
ized as a data marker and their respective confidence        forest plot above that they included 95% CI error bars,
interval (we will cover next, but generally 95%) repre-      which indicates where the true mean will fall within
sented as a horizontal line 13. The size of the data mark-   that bar on 95% of occasions 15. SE and CI with wider
er generally represents the sample size, or the weight       bars indicate larger error and shorter bars indicate
carried by that particular study in the meat-analysis.       higher precision, as sample sizes increase the bars be-
Diamond markers are generally used to represent the          come shorte 15. Error bars are helpful in visually depict-
overall or pooled result 13. In the example below adapt-     ing the significance in changes between groups. When
ed by Morton et al. (2017), you will find three different    error bars overlap the difference isn’t significant or in
diamonds 14. The first two unfilled diamonds represent       other words, the larger the gap between error bars the
the pooled results of trained vs. untrained samples and      smaller the p-value will be. Error bars can be valuable in
the filled in or dark diamond represents the overall or      justifying the authors conclusions, but like any statistic
total results of the meta-analysis (including trained and    they are only a guide and you should rely upon your own
trained subjects). Oftentimes forest plots will contain a    logic and understanding to determine the meaningful-
clear description of what each marker symbolizes un-         ness in the results being reported 15.
derneath the actual figure.
                                                             Tables
Error bars                                                   Tables are generally self-explanatory and describe the
Elements that you will commonly see on most figures          different symbols in the figure legend/description be-
are error bars. Error bars are lines that represent the      low. This table is from Layne’s PhD thesis where they
variability of the data being reported. There are differ-    examined the time course of plasma amino acid lev-
ent types of error bars and if the legend or description     els in response to ingestion of various protein sources
Lys 608 ± 24 1083 ± 78 a* 593 ± 34 688 ± 62 930 ± 64* 553 ± 28 698 ± 14 933 ± 67 597 ± 55 726 ± 48
Met 49 ± 2 102 ± 6 a* 62 ± 2 a* 80 ± 5 a* 72 ± 3 b* 42 ± 1 b 52 ± 3 b 71 ± 3 b* 44 ± 4 b 46 ± 2 b
Thr 309 ± 9 594 ± 73 * 567 ± 18 a* 554 ± 38 a 383 ± 21 330 ± 18 b 314 ± 22 b 387 ± 12 382 ± 20 ab 308 ± 13 b
   1
     Plasma amino acids express as umol/L.
   2
     Data are mean ± SE<; n = 5-6. Means without a common letter differ between treatments within.
   time-points, P < 0.05.* Indicates different from fasted (P < 0.05).
   3
     12 h food-deprived controls.
Challenges for
Researchers
Research critics will often complain about studies
not performing a specific measurement or failing to
account for some variable. Oftentimes these criticisms
are invalid or unwarranted because of the limits
imposed on researchers. Armchair scientists who
unfairly criticize studies for certain aspects oftentimes
fail to recognize the challenges that researchers in
nutrition and exercise science face. Depending on the
academic institution, labs and universities vary widely
in the equipment and funding they have available for
research. Obviously, larger labs with graduate and
postdoctoral programs are able to attract larger grants
and more funding for projects which leads to more
sophisticated testing instruments and a higher level of
control over testing conditions. While there is growing
interest in exercise and nutritional sciences which
leads to more funding sources, there are still studies       $5,600 and that’s just to test one hormone. That’s not
that can’t be conducted due to lack of resources.            considering other lab supplies you might need, and the
                                                             researcher wouldn’t be able to pay their staff anything
Funding                                                      which means they would need to find students who are
The primary challenge for researchers in exercise            willing to volunteer their time on top of their academic
and nutritional science is funding. There are various        responsibilities. If you’re looking at studies that test
funding sources available such as governmental like the      protein metabolism in rats, the cost of carrying out an
NIH, University grants, industry funding from food or        experiment could be upwards of $50,000. Many studies
supplement companies, organizations such as ACSM,            need to pay subjects to recruit the necessary sample
NSCA, and other private foundations and non-profit           size and if it’s a dieting study that includes supplying
organizations. The unfortunate reality is that even with     food, the cost of food can be astronomical. Nowadays
studies receiving funding, the funding generally isn’t       many supplement companies are becoming more
enough to support the desired level of control to be         interested in having scientifically validated research
considered a high-quality study. To give you an idea of      to support the efficacy of their products for improved
how quickly the costs for a study can add up, here in        marketing. Some studies sponsored by supplement
Florida the cost of performing a blood hormone test like     companies can cost tens of thousands of dollars and
leptin is roughly $70 per blood draw. So, let’s assume you   can even reach upwards of hundreds of thousands of
wanted to test 10 subjects before and after a diet, that’s   dollars when offering to pay subjects to participate. We
two leptin tests per subject which adds up to $1,400         haven’t even discussed the costs associated with the
for only 10 subjects. That’s a small sample size and if      instrumentation necessary to test certain variables in a
you wanted to make it a stronger study you would likely      lab. Generally, departments receive funding from their
need more like 40 people which could cost upwards of         Universities for lab related costs to maintain, repair
How To Read Research: A Biolayne Guide                                                                             20
or replace testing equipment. The amount received         exercise science programs will have a metabolic cart,
yearly for department budgets is generally only enough    treadmills, cycle ergometers, various types of body
to afford maintenance on their current equipment and      composition testing instruments, heart rate and blood
replace regularly used supplies, they can’t afford to     pressure monitors, and some other performance-
buy new equipment or replace machines every year.         based testing equipment, but again this will depend
Most exercise science programs have what’s called         on the university, the region and the faculty’s research
a metabolic cart (which we’ll discuss later) and costs    interest. We will cover some common measurement
upwards of $20,000, that’s not including the costs to     techniques later, but it’s important to understand that
maintain normal functioning or replace certain supplies   very few labs have the most sophisticated testing
needed for regular use. That is why labs are limited by   equipment like a metabolic ward, MRI’s or muscle
funding and the equipment they have available.            biopsy testing, due to funding. Aside from the major
                                                          challenges of funding and lab equipment, researchers
Available lab equipment                                   are governed by their institution to ensure responsible
It should now be no surprise why most exercise science    research conduct.
programs can’t afford to have sophisticated testing
equipment. The type of equipment in a researcher’s lab    IRB/ethics boards
will determine the type of studies they can conduct.      Academic institutions have ethics boards or governing
Some labs are focused on more mechanistic studies         bodies that oversee experimental research. At many
that involve molecular biology experimentation using      universities the governing body is known as the
cells and microscopes, whereas other labs are focused     Institutional Review Board (IRB) for humans and the
                                                                                Institutional Animal Care and
                                                                                Use Committee (IACUC) for
                                                                                animal research . The purpose of
                                                                                these departments is to ensure
                                                                                safe and ethical standards are
                                                                                being followed according to
                                                                                laws and regulations. Before a
                                                                                study can begin recruiting and
                                                                                testing subjects, they must go
                                                                                through a formal review process
                                                                                to obtain study approval. This
                                                                                is one of the most annoying
                                                                                processes involved in research
                                                                                because it’s time consuming
                                                                                and tedious. It’s comparable
                                                                                to filing your taxes, but more
                                                                                detail oriented and time
                                                                                consuming. While necessary,
                                                                                this approval process can take
                                                                                away time from conducting
                                                                                the experiment because most
on more practical and applied research that investigate   academic institutions operate on semester timelines
the effectiveness of a type of training modality.         that may include breaks or holidays that interfere with
Researchers will focus on a specific area of interest     the study timeline. So, if it takes 8 weeks to approve
and build their labs around that focus. The majority of   a study and then another 3 weeks to recruit enough
How To Read Research: A Biolayne Guide                                                                          21
subjects that’s the majority of the semester and only        50 subjects and the training program consists of 3 full
leaves a few weeks to conduct an experiment. This is         body days per week supervised in the lab by research
why you will often see studies that aren’t much longer       staff. Not only will you have to create a schedule for
than 12 weeks in duration. The IRB process includes an       the research staff to supervise each training day, but
informed consent for subjects and a very formal written      you’ll also need to schedule each participant for each
study protocol explaining in detail every aspect of the      training session each week. Not to mention, you’ll have
study, including how you intend to recruit subjects.         to schedule your baseline testing, mid-point testing
                                                             (if there is one) and post-testing. Depending on which
Subject Recruitment                                          measurements will be taken, it could take an hour for
Subject recruitment is the other annoying process            each participant, which means 50 hours per testing
for conducting human research. Recruitment can be            session multiplied by three testing points and that’s
difficult and time consuming for exercise science and        150 hours only for the measurement testing sessions.
nutrition researchers. As mentioned previously, many         That doesn’t account for the hour each subject is
labs don’t have the necessary funding to pay subjects        training in the lab 3 days per week over 12 weeks. The
to participate in their studies. Free protein powder         time requirement researchers ask from their subjects
and supervised training in the lab can be an appealing       can be a lot. This is a good example of why you don’t
incentive to some, but many others don’t want to follow      see many training studies over 12 weeks, it takes a lot
a standardized program for fear of less than optimal         of time and money!
results. This is why you generally see sample sizes less
than 50 in training studies. Even if a researcher is lucky
enough to recruit 50 people you generally have subjects
drop out due to various reasons and can end up losing        Trusting Research
up to 20 subjects or more sometimes depending on
testing or intervention requirements. People have a          How can you trust research and how do you evaluate
hard time following specific instructions, especially if     studies that show conflicting findings? Individuals
it means changing their usual lifestyle to accommodate       without research experience are at a severe
study procedures when there is no incentive to comply.       disadvantage when it comes to being able to tease out
Think about asking college students to follow a specific     the nuances and extrapolate upon results presented
diet and no alcohol on the weekends or asking them           in publications.
to come to the lab early before classes for testing or
training, or how about asking them if it’s ok to stick       Bias
a needle as large as a pencil in their leg for a muscle      We all have our own biases towards certain ideas or
biopsy? Obviously, studies that include animal models        topics, unfortunately most people either fail to admit
don’t have to ‘recruit’ subjects, but they have to pay       or don’t realize they have a bias towards a particular
more for their ‘subjects’.                                   topic. Good scientists recognize and acknowledge
                                                             their bias in an effort to tightly control for them in their
Scheduling and Testing                                       experimental design. Being biased means having an
As mentioned earlier, scheduling and experimental            unbalanced opinion or belief regarding a certain topic or
time frames can be a major issue in conducting               idea. This often leads to being close-minded and failing
experiments, especially if operating under University        to recognize conflicting or contrary evidence, beliefs
semester timelines. Even if studies have the                 or ideas. Scientifically speaking, bias is a systematic
opportunity to occur over multiple semesters or with         deviation between an estimated value and its true value 3.
no time restrictions, scheduling can be a logistical         In other words, it can be used to represent error.
nightmare for research staff. For example, let’s assume      There are a few types of biases that are important to
a study is investigating muscle growth over 12 weeks in      understand to become more critical of research.
How To Read Research: A Biolayne Guide                                                                                22
Confirmation Bias - This is essentially when people          are a serious undertaking and require substantial time,
will cite evidence or report data that fits their bias or    money, and effort to complete them. When the results
belief, while ignoring or failing to provide evidence        turn out to be non-significant it can be crushing to
that says otherwise. You’ll oftentimes see unethical         the researcher and the amount of time and headache
individuals cite one study that supports their argument      they would have to put into getting it published just
while failing to acknowledge five other studies that         isn’t worth it so they store it in a file and forget about
refute their argument. There could also be a scenario        it (“file drawer effect”) 17. There’s also scenarios where
where someone misinterprets or takes very weak               graduate students carry the responsibility of writing
evidence and glorifies it to make it seem stronger           up and submitting their manuscript for publication
than it really is. Politics is a good example, you will      after completing the research project and instead
oftentimes see certain media or news outlets reporting       they either graduate or move on to another program
a story that is misleading or simply untrue. They may        without completing the publication process. Other
use a weak study or twist the narrative of a particular      times researchers still put in the effort to get their
topic to support their side of the story. Sometimes          study published but due to publication bias of journals
you’ll see a news report showing only a piece of an          it may be difficult or impossible to receive acceptance.
interview or press conference where it falsely portrays      However, reasons for researchers being guilty of
an individual’s beliefs to make them look bad and push       publication bias are due to lack of time, low quality or
their own political agenda. In research you may come         an incomplete study, fear of rejection, or insignificant
across a discussion where authors are comparing their        findings 16.
findings to other studies, but they fail to acknowledge
other studies that refute their findings.                    Even though resources, time and effort will go
                                                             wasted when studies aren’t published, there are
Publication Bias - Publication bias is actually a pretty     some consequences of failing to publish studies with
common and unfortunate practice in the scientific            negative results. Before researchers invest time in
community. This type of bias is concerned with               designing a study, they obviously explore journals to
publishing studies that only report significant results.     find publications that are similar to their research
Published studies that support their hypothesis              question or hypothesis and evaluate their findings.
represent 85.9% of published studies in 2007 compared        If a study isn’t published due to negative results and
to studies that reject their hypothesis 16. Let’s face it,   another researcher wants to test the same hypothesis,
studies with stronger findings or significant results        they will be wasting valuable time and resources on a
are more appealing to readers, especially editors and        study that would produce negative results. Therefore,
publishers because they’re more likely to get cited in       even though a study produces negative results it
other research, which leads to higher journal impact         should still be published to inform future research.
factors and more revenue for journals 16. Completing a       Additionally, unpublished data can misguide meta-
study with insignificant findings can pose challenges for    analysis findings and conclusions. If meta-analyses
researchers and leaving them unpublished also poses          are using data that only show significant findings when
a few issues. While the majority of responsibility for       there are unpublished studies to conflict with some
publication bias lies with journal editors and publishers,   studies, they can produce false positives and misguide
researchers can be guilty also. Researchers are busy         recommendations 16. Appropriately performed meta-
and they usually have a research agenda planned out          analysis of clinical trials are the highest quality of
so that once a study is completed, they can begin on         scientific publications and commonly used for health-
the next project, and oftentimes they have multiple          care decision making and therapies 18. One of the more
research projects occurring at the same time. Earlier we     serious consequences of unpublished negative data is
briefly described what goes into designing and carrying      the potential harm to individuals from pharmaceutical
out a research study, it’s obvious that research studies     drugs or even supplements. Publishing these negative
How To Read Research: A Biolayne Guide                                                                               23
results could improve safety and standards of drugs         transparent and acknowledge any potential personal
before they’re released [16, 18]. Maybe a supplement        benefit or gain of the researchers or parties involved.
study is carried out and finds no positive effect of        This should be a clear indication that they aren’t trying
their treatment, but there were some subjects who           to “hide” something or be dishonest, it should represent
reported adverse symptoms or side effects. This             the opposite. If dishonest researchers were attempting
study goes unpublished but could be detrimental to          to conceal some relationship or personal benefit, they
someone’s health.                                           simply would risk not listing a conflict of interest. Earlier
                                                            we mentioned various sources of funding including
Inflation Bias - Commonly referred to as “p-hacking”,       food and supplement companies, governmental
this is when unethical researchers will try a wide          organizations, private companies, etc. When you come
variety of statistical tests and then selectively report    across a supplement company funding a dieting study
the significant results 17. This is essentially when        or a study investigating the effectiveness of a particular
researchers torture their data until they obtain a          supplement, this should raise a red flag, as with any
significant finding. It’s important to understand           type of company funding a study that investigates their
that statistical analyses should be pre-determined          product. But again, it just means you should evaluate
and a part of the study design process. P-hacking           the findings more critically. Before even evaluating the
commonly occurs when researchers conduct a study            results check the study design. Was it a randomized
and after collecting data decide to perform additional      placebo-controlled design? If not, you should be very
or different statistical tests based on the gathered        apprehensive to the findings and results. Randomizing
data. Another common occurrence is when they                and having a placebo-controlled design is essential
simply eliminate outlier data from subjects who didn’t      when comparing treatments.
respond or responded much greater than the rest of
the group. Another situation in which researchers are       Evaluating Conflicting Evidence
guilty of p-hacking is when they manipulate or change       Let’s assume there have only been two studies published
the groups, they established at the beginning of the        on a certain topic and they report contrasting findings.
study to make one group look like they experienced          How do you determine which study is better or which
greater change. Lastly, p-hacking can occur from            study to trust? This is a difficult question to answer
researchers performing data analysis part way through       and involves many considerations, but we will highlight
the duration of the study and discontinuing the study       certain aspects and key details you’ll want to focus on.
based on their results or simply not performing other
statistical tests once they find significance [17].         Results - The level of significance of the results
Ethical researchers will do their best to address and       is important and this is one of the first things you
acknowledge their biases, which sometimes can be            should notice, but as mentioned previously (statistical
unintentional. Unethical researchers obviously make         concepts), how meaningful are the results? Remember,
choices with illintent and biases are irrelevant in those   we want to see a P-value < 0.05 and the higher the effect
situations. Science and peer-reviewed research does a       size value, the more meaningful it is. After evaluating
pretty good job at weeding out the bad apples and part      the statistics, check to see if there is any missing data
of this deals with addressing conflicts of interest.        or if authors also published raw data within the text,
                                                            appendix or supplementary material. A good example
Funding Sources / Conflicts of Interest - Any time          is, if a study is comparing two different types of diets,
there is a conflict of interest listed at the bottom of     they should have a table showing their respective diet
a publication it should be evaluated more critically.       compositions, if not some type of food records or
However, this doesn’t mean you should immediately           nutrition data. If there isn’t any type of nutrition data
discredit or dismiss the study or the findings. Ethical     and it’s a diet study, we would be VERY cautious of the
researchers list their conflicts of interest to be          findings and the conclusions that are drawn. Publishing
How To Read Research: A Biolayne Guide                                                                                24
raw data is not necessary, but it’s a good practice and if    10 subjects it carries a lot less weight than studies
there’s raw data available look it over for yourself to see   with larger cohorts, but they can still be valuable and
if there’s any glaring issues or if some of the numbers       contribute to the body of literature. Case studies are at
don’t add up. Within the results section they obviously       the bottom of the totem pole for study designs, but for
will report the results from statistical analysis for         investigating certain novel topics they can be the only
the primary variables of interest, but they should            appropriate design available. These types of studies
also provide some type of figure or table to visually         should just be interpreted with caution and understand
represent the data. Lastly, do the results of the study       that their ability to draw strong conclusions is severely
agree with previous studies? It’s ok if they don’t, but in    limited. The caveat to this is with studies that are
the discussion the authors should explain conflicting         extremely well controlled but have a small subject
results and if there is a reason why results don’t agree.     number. An example of these types of studies would
                                                              be metabolic ward nutritional studies. In these studies
Study Design / Level of control - How much control            every piece of food is provided to the subjects and
did the researchers have over the independent                 they are housed in a ward that measures their energy
variables? Did they provide food to participants if it        expenditure. These types of studies do not need to
was a diet study? Were they supervising the resistance        have a high subject number in order to be impactful due
training program prescribed to participants? How              to their high degree of control. They are also incredibly
did they control free-living conditions? Obviously,           expensive which is why they typically don’t have a high
there are no mandatory requirements researchers               subject number.
should be following for their study design, this will be
limited by their laboratory techniques and equipment          Study Duration - You will generally encounter training
they have available. But, there are some things you           studies in exercise science with durations around
should be asking yourself when reading through the            12 weeks. This isn’t a bad thing, but the strength of
methods section, how did they test and control for X,
Y and Z. If a study had subjects in a metabolic ward
that’s far more valuable data than any free-living
study. Similarly, if a training study doesn’t mention
anything about supervised training it’s going to carry
more confounding variables and limitations than a
study that included supervised training in the lab for
the duration of the study. The level of control is going
to significantly impact the sample size and the study
duration. Increasing the level of control comes at a
cost, higher control = higher cost and generally leads
to a smaller sample size and shorter study durations
to maintain that level of control. Unlike human model
designs, rodent models offer a high level of control,
longer study duration and a larger sample size at a
smaller cost compared to human subject designs. But
the results aren’t always transferable to humans.
Common
Methods for
Measuring
Variables
Body Water
Deuterium Dilution
Deuterium is a stable isotope of Hydrogen and
deuterium dilution serves as the “gold standard” or
criterion method for total body water assessment.
Researchers use a labeled water that contains a large
quantity of deuterium (“heavy water”) and measure
concentrations in the urine, blood or saliva to measure
total body water. There are other isotopes that can
be used in a similar manner to the deuterium dilution          transmission. Based on fat mass content in your body,
method, but most commonly it is deuterium that’s               the impedance (resistance) of the electrical current
used as a tracer. Using this method subjects void their        is measured using Ohm’s law (resistance = volume
bladders than drink water with the labelled isotope and        / current) which can then be applied in an equation
after it has equilibrated in the body for a duration of time   to quantify water volume, percentage body fat, and
researchers most commonly collect a urine sample.              FFM 21. There are many different types of BIA devices
The urine is then analyzed using a mass spectrometer           available and vary based on specific frequencies, cost
to determine total body water levels. This method is           and complexity, which will impact the validity and
expensive, time consuming and requires sophisticated           reliability of the specific device being used. Nowadays
laboratory expertise 26. For this reason, other measures       you will commonly see BIA technology integrated into
have been developed to more conveniently measure               at-home body weight scales. When used for body
total body water (TBW).                                        composition assessment, research indicates that BIA
                                                               is comparable to DXA when estimating BF%, fat mass
Bioelectrical Impedance Analysis (BIA)                         or fat-free mass (FFM) 27. However, other research
BIA technology uses a small electrical current that            indicates that single assessments using DXA or BIA
is transmitted through your body extremities and               is questionable due to their accuracy on an individual
between voltage detecting electrodes (contacting               level [28]. When compared to deuterium dilution
hands and/or feet). Water conducts electricity and             for measuring TBW, BIA is close in accuracy, but still
tissues like fat mass and bone have very little water          slightly underestimates TBW 29. BIA shows promise
which increases the resistance (impedance) of the              in accurately estimating TBW, however accuracy in
electrical current thereby decreasing the rate of its          measurement can vary based on the population being
How To Read Research: A Biolayne Guide                                                                              28
studied and with little research comparing BIA to             fat mass and/or fat-free mass (FFM). The only direct
deuterium dilution, the validity to accurately estimate       measurement of body composition would involve
TBW remains questionable 30. Nonetheless, evidence            performing an autopsy on a human cadaver to dissect
suggests BIA is acceptable for assessing TBW and              and weigh various tissues and organs, which is obviously
displays acceptable accuracy when assessing body              impossible for free living experiments. Therefore, we
composition if incorporated into a multi-compartment          estimate body composition based on what we know
model 28. Another tool that shares similarities to BIA        about the weight and composition of various tissues in
known as Bioelectrical impedance spectroscopy (BIS),          the body. It’s important to understand that there is no
seems to exhibit greater validity and reliability than BIA    perfect estimate and all techniques and methods have
when assessing TBW 26.                                        error rates associated with them. For this reason, we
                                                              cannot place a high level of importance with a specific
Bioelectrical Impedance                                       percentage of body fat. Rather, we use it as an objective
Spectroscopy (BIS)                                            measure to quantify and track changes to determine
BIS features the same underlying technology as                the effectiveness of specific interventions.
BIA to estimate body composition and water, which
includes an electrical current traveling through the          Skinfold
body between electrodes to measure the impedance              The most common and cost-effective method for
of the electrical current. BIS devices differ from BIA        estimating body composition is the skinfold technique.
devices by utilizing a ‘spectra’ of frequencies, which is     This technique assumes a 2-compartment (2C) model
where the term spectroscopy comes from 30. Although           (more on multi-compartment models later), splitting
there are single and multi-frequency BIA devices on           body weight into fat mass and FFM.
the market and it’s unclear at what frequency a BIA
could be considered BIS; BIS uses Cole modelling              This technique requires firmly grasping the subject’s
to predict body fluids, which has been suggested to
                                                              subcutaneous fat and skin with the thumb and
be superior for assessing body composition using
                                                              forefingers to measure the thickness (in mm.) with a
impedance based methods 30, 32, 33. BIS is also useful in
                                                              caliper. You can accomplish these measurements with
differentiating between intracellular and extracellular
                                                              as few as three sites or as many as seven including
body water. The underlying principles used for BIA
                                                              the triceps, subscapular, suprailiac, abdominal, upper
and BIS are the same for estimating body composition
                                                              thigh, chest, and midaxillary. Measuring seven sites
and either device can acceptably be utilized for body
water estimations, however it appears BIS is more
accepted 26, 28, 30, 33. It’s important to keep in mind the
underlying principles for how these impedance based
devices were developed and they’re primarily for body
water assessment, although they can predict body fat
% (BF%), other body composition methods would be
more acceptable.
Body Composition
There are a number of techniques and methods
available for measuring body composition, specifically
How To Read Research: A Biolayne Guide                                                                               29
give a more accurate estimate of BF% because it              the group average BF% the error in BF% estimation
can account for body fat distribution, some people           could be only 2%. These are arbitrary numbers and don’t
hold more fat in their lower body compared to upper          reflect the true error rates of skinfolds, those will vary
body. The sum of these site measurements are added           depending on the equation, population and criterion
together and plugged into a prediction equation to           method being used for comparison. Nonetheless,
estimate body density, which is then plugged into the        skinfolds are the most cost-effective method and with
Siri equation to estimate body fat percentage (BF%)          a skilled technician and correct equations, they can
34
   . There are a number of body density prediction           provide an accurate estimate of body composition.
equations available and it’s important to use a population
specific equation because the coefficients used in the       A-mode Ultrasound
calculations can produce inaccurate estimations for          A-mode ultrasound uses ultrasonography technology,
individuals with varying body fat levels. When using         which transmits a signal through the skin and tissues
an appropriate population specific equation, skinfold        and the reflection of the signal at tissue boundaries is
fairly accurately predicts BF% (± 3-4%) 35. The great        transmitted back as an “echo”. There is also another
thing about skinfold is not only the low cost, but you       type of ultrasound known as “B-mode” (we’ll cover
can track site-specific changes
to gauge the rate and location of
fat loss. Additionally, this is one
of the few measurements that
actually assess fat thickness,
most other measures use X-ray
beams and imaging techniques
or electrical currents to assess
fat mass. This technique is
only as accurate and reliable
as the technician who is
performing the test. The
technician must have a lot of
experience developing this skill
to precisely identify anatomical
site location and accurately
measure        fat       thickness
consistently. When compared
to computed tomography (CT
scan) skinfold shows a strong
correlation when comparing
measurements performed in the abdominal region               later), but we’re specifically referring to A-mode
36
  . However, studies comparing skinfolds to the gold         ultrasound. Bodymetrix has developed a handheld
standard 4C model, results indicate large individual         portable device that is used similarly to how skinfolds
error rates, but acceptable group average values 37, 39.     are conducted. The device can be used to measure
Meaning, when you test one person the error rate can         as few or as many sites as desired, simply select the
be much higher compared to measuring and averaging           equation and number of sites from a drop-down menu
the BF% of a group of people. For example, you could         in the software. This technique also relies on the skill
compare skinfolds to another method and see an over          of the technician. One of the primary benefits is being
or under estimation in BF% by 6%, but when comparing         less invasive since it does not include “pinching” the
                                         4 Compartment
                                             Model
11.   Lakens, D. (2013). Calculating and reporting effect sizes to facilitate   25. Katsanos, C. S., Chinkes, D. L., Sheffield-Moore, M., Aarsland, A.,
      cumulative science: a practical primer for t-tests and ANOVAs.                Kobayashi, H., & Wolfe, R. R. (2005). Method for the determination of
      Frontiers in psychology, 4, 863.                                              the arteriovenous muscle protein balance during non-steady-state
                                                                                    blood and muscle amino acid concentrations. American journal of
12.   Cohen, J. (1988). Statistical power analysis for the social sciences          physiology. Endocrinology and metabolism, 289(6), E1064–E1070.
      (2nd ed.). Routledge.
                                                                                26. Kerr, A., Slater, G., Byrne, N., & Chaseling, J. (2015). Validation of
13.   King, L. (2018). Preparing better graphs. Journal Of Public Health And        Bioelectrical Impedance Spectroscopy to Measure Total Body Water
      Emergency, 2(1).                                                              in Resistance-Trained Males. International journal of sport nutrition
                                                                                    and exercise metabolism, 25(5), 494–503.
14.   Morton, R. W., Murphy, K. T., McKellar, S. R., Schoenfeld, B. J.,
      Henselmans, M., Helms, E., Aragon, A. A., Devries, M. C., Banfield,       27.   Schoenfeld, B. J., Nickerson, B. S., Wilborn, C. D., Urbina, S. L.,
      L., Krieger, J. W., & Phillips, S. M. (2018). A systematic review,              Hayward, S. B., Krieger, J., Aragon, A. A., & Tinsley, G. M. (2020).
      meta-analysis and meta-regression of the effect of protein                      Comparison of Multifrequency Bioelectrical Impedance vs. Dual-
      supplementation on resistance training-induced gains in muscle                  Energy X-ray Absorptiometry for Assessing Body Composition
      mass and strength in healthy adults. British journal of sports                  Changes After Participation in a 10-Week Resistance Training
      medicine, 52(6), 376–384.                                                       Program. Journal of strength and conditioning research, 34(3),
                                                                                      678–688.
15.   Cumming, G., Fidler, F., & Vaux, D. L. (2007). Error bars in
      experimental biology. The Journal of cell biology, 177(1), 7–11.          28. Graybeal, A. J., Moore, M. L., Cruz, M. R., & Tinsley, G. M. (2020).
                                                                                    Body Composition Assessment in Male and Female Bodybuilders:
16.   Mlinarić, A., Horvat, M., & Šupak Smolčić, V. (2017). Dealing with the        A 4-Compartment Model Comparison of Dual-Energy X-Ray
      positive publication bias: Why you should really publish your negative        Absorptiometry and Impedance-Based Devices. Journal of strength
      results. Biochemia medica, 27(3), 030201.                                     and conditioning research, 34(6), 1676–1689.
31.   Matias, C. N., Santos, D. A., Gonçalves, E. M., Fields, D. A., Sardinha,   44. Friedl, K. E., DeLuca, J. P., Marchitelli, L. J., & Vogel, J. A. (1992).
      L. B., & Silva, A. M. (2013). Is bioelectrical impedance spectroscopy          Reliability of body-fat estimations from a four-compartment model
      accurate in estimating total body water and its compartments in elite          by using density, body water, and bone mineral measurements. The
      athletes?. Annals of human biology, 40(2), 152–156.                            American journal of clinical nutrition, 55(4), 764–770.
32. Cole, K.S. Permeability and impermeability of cell membranes for ions        45. Wilson, J. P., Strauss, B. J., Fan, B., Duewer, F. W., & Shepherd, J.
    in Cold Spring Harbor Symposia on Quantitative Biology. 1940. Cold               A. (2013). Improved 4-compartment body-composition model for a
    Spring Harbor Laboratory Press.                                                  clinically accessible measure of total body protein. The American
                                                                                     journal of clinical nutrition, 97(3), 497–504.
33. Matthie, J. R. (2008). Bioimpedance measurements of human body
    composition: critical analysis and outlook. Expert review of medical         46. Nickerson, B. S., & Tinsley, G. M. (2018). Utilization of BIA-Derived
    devices, 5(2), 239-261.                                                          Bone Mineral Estimates Exerts Minimal Impact on Body Fat Estimates
                                                                                     via Multicompartment Models in Physically Active Adults. Journal of
34. Siri, W. E., Brozek, J., & Henschel, A. (1961). Techniques for measuring         clinical densitometry : the official journal of the International Society
    body composition. Washington, DC: National Academy of Sciences,                  for Clinical Densitometry, 21(4), 541–549.
    223-224.
                                                                                 47.   Williams, J. E., Wells, J. C., Wilson, C. M., Haroun, D., Lucas, A., &
35. Withers, R. T., Craig, N. P., Bourdon, P. C., & Norton, K. I. (1987).              Fewtrell, M. S. (2006). Evaluation of Lunar Prodigy dual-energy X-ray
    Relative body fat and anthropometric prediction of body density                    absorptiometry for assessing body composition in healthy persons
    of male athletes. European journal of applied physiology and                       and patients by comparison with the criterion 4-component model.
    occupational physiology, 56(2), 191–200.                                           The American journal of clinical nutrition, 83(5), 1047–1054.
36. Orphanidou, C., McCargar, L., Birmingham, C. L., Mathieson, J., &            48. Clasey, J. L., Kanaley, J. A., Wideman, L., Heymsfield, S. B., Teates,
    Goldner, E. (1994). Accuracy of subcutaneous fat measurement:                    C. D., Gutgesell, M. E., Thorner, M. O., Hartman, M. L., & Weltman, A.
    comparison of skinfold calipers, ultrasound, and computed                        (1999). Validity of methods of body composition assessment in young
    tomography. Journal of the American Dietetic Association, 94(8),                 and older men and women. Journal of applied physiology (Bethesda,
    855–858.                                                                         Md. : 1985), 86(5), 1728–1738.
37.   van Marken Lichtenbelt, W. D., Hartgens, F., Vollaard, N. B., Ebbing,      49. Kullberg, J., Brandberg, J., Angelhed, J. E., Frimmel, H., Bergelin, E.,
      S., & Kuipers, H. (2004). Body composition changes in bodybuilders:            Strid, L., Ahlström, H., Johansson, L., & Lönn, L. (2009). Whole-body
      a method comparison. Medicine and science in sports and exercise,              adipose tissue analysis: comparison of MRI, CT and dual energy X-ray
      36(3), 490–497.                                                                absorptiometry. The British journal of radiology, 82(974), 123–130.
38. Evans, E. M., Saunders, M. J., Spano, M. A., Arngrimsson, S. A., Lewis,      50. Tothill, P., & Hannan, W. J. (2000). Comparisons between Hologic
    R. D., & Cureton, K. J. (1999). Body-composition changes with diet and           QDR 1000W, QDR 4500A, and Lunar Expert dual-energy X-ray
    exercise in obese women: a comparison of estimates from clinical                 absorptiometry scanners used for measuring total body bone and soft
    methods and a 4-component model. The American journal of clinical                tissue. Annals of the New York Academy of Sciences, 904, 63–71.
    nutrition, 70(1), 5–12.
                                                                                 51.   Haun, C. T., Vann, C. G., Roberts, B. M., Vigotsky, A. D., Schoenfeld,
39. Peterson, M. J., Czerwinski, S. A., & Siervogel, R. M. (2003).                     B. J., & Roberts, M. D. (2019). A Critical Evaluation of the Biological
    Development and validation of skinfold-thickness prediction                        Construct Skeletal Muscle Hypertrophy: Size Matters but So Does the
    equations with a 4-compartment model. The American journal of                      Measurement. Frontiers in physiology, 10, 247.
    clinical nutrition, 77(5), 1186–1191.
                                                                                 52. Vigotsky, A. D., Schoenfeld, B. J., Than, C., & Brown, J. M. (2018).
40. Kuehne, T. E., Yitzchaki, N., Jessee, M. B., Graves, B. S., &                    Methods matter: the relationship between strength and hypertrophy
    Buckner, S. L. (2019). A comparison of acute changes in muscle                   depends on methods of measurement and analysis. PeerJ, 6, e5071.
    thickness between A-mode and B-mode ultrasound. Physiological
    measurement, 40(11), 115004.                                                 53. Haun, C. T., Vann, C. G., Mobley, C. B., Roberson, P. A., Osburn, S. C.,
                                                                                     Holmes, H. M., Mumford, P. M., Romero, M. A., Young, K. C., Moon, J.
41.   Wagner D. R. (2013). Ultrasound as a tool to assess body fat. Journal          R., Gladden, L. B., Arnold, R. D., Israetel, M. A., Kirby, A. N., & Roberts,
      of obesity, 2013, 280713.                                                      M. D. (2018). Effects of Graded Whey Supplementation During
                                                                                     Extreme-Volume Resistance Training. Frontiers in nutrition, 5, 84.
42. Schoenfeld, B. J., Aragon, A. A., Moon, J., Krieger, J. W., & Tiryaki-
    Sonmez, G. (2017). Comparison of amplitude-mode ultrasound versus            54. Ward L. C. (2018). Human body composition: yesterday, today, and
    air displacement plethysmography for assessing body composition                  tomorrow. European journal of clinical nutrition, 72(9), 1201–1207.
    changes following participation in a structured weight-loss