keefereuther commited on
Commit
c44257b
·
verified ·
1 Parent(s): 1804731

Delete terms.csv

Browse files
Files changed (1) hide show
  1. terms.csv +0 -170
terms.csv DELETED
@@ -1,170 +0,0 @@
1
- TERM,CONTEXT
2
- aes() mapping,Maps variables to visual properties like x y color size in ggplot
3
- alpha (significance level),Probability threshold for Type I error commonly set at 0.05
4
- alternative hypothesis,Research hypothesis claiming an effect or difference exists
5
- animal welfare protocols,Ethical guidelines ensuring humane treatment in research with vertebrates
6
- ANOVA (one-way),Tests for mean differences across three or more groups
7
- assumptions of linear regression,Linearity normality of residuals and homoscedasticity requirements
8
- augment(),broom function that adds residuals fitted values and diagnostics to model data
9
- bar plot,Shows counts or means for categorical variables
10
- bcPower(),Function in car package for Box-Cox power transformations
11
- binary data,Categorical variable with two levels like yes/no or presence/absence
12
- bioinformatics and computational biology methods,Sequence alignment phylogenetics protein folding machine learning for genomic data
13
- biological replicate,Independent experimental units providing true replication
14
- blinding,Concealing treatment assignment to reduce bias
15
- blocking,Grouping by known variable like age or location to control its effects
16
- Bonferroni correction,Adjusts alpha by dividing by number of tests to control Type I error
17
- bootstrapping,Resampling with replacement to estimate confidence intervals and standard errors
18
- Box-Cox transformation,Power transformation to normalize data and stabilize variance using optimal lambda
19
- boxplot,Displays median IQR whiskers and outliers for group comparisons
20
- broom package,Tidies model output into data frames for easier manipulation
21
- case-control study,Compares groups with and without outcome to identify risk factors
22
- categorical data,Qualitative groups like species or treatment levels
23
- Central Limit Theorem,Sample means approach normal distribution as n increases regardless of population shape
24
- central tendency,Measures of data center including mean median and mode
25
- chi-squared goodness of fit,Tests if observed frequencies match expected frequencies for one categorical variable
26
- chi-squared test of independence,Tests if two categorical variables are associated or independent
27
- CO2 dataset,Built-in R dataset with plant uptake measurements used for regression examples
28
- coefficient of determination (R²),Proportion of variance in response explained by predictors
29
- Cohen's d,Standardized effect size measure for mean differences
30
- confidence interval (95%),Range likely to contain true parameter value with 95% confidence
31
- confounding variable,Factor that correlates with both treatment and outcome
32
- conservation biology methods,Population viability analysis habitat modeling biodiversity assessment species monitoring
33
- continuous data,Quantitative measurements like weight length or concentration
34
- control group,Baseline comparison receiving no treatment or standard treatment in experiments
35
- Cook's distance,Measures influence of each observation on regression model identifies outliers
36
- cor.test(),R function for testing correlation significance between two variables
37
- correlation coefficient (r),Standardized measure of linear association from -1 to 1
38
- cross-over design,Each participant receives all treatments in different periods with washout between
39
- cross-sectional study,Data collected at single time point across different subjects
40
- data transformation,Mathematical modifications like log or square root to meet assumptions
41
- discrete data,Count data taking only integer values
42
- double-blind,Neither participants nor researchers know treatment assignment
43
- dplyr,R package for data manipulation with verbs like select filter mutate
44
- ecology and evolution methods,Mark-recapture species distribution modeling community ecology population genetics
45
- effect size,Magnitude of difference between groups independent of sample size
46
- ethics in research,Principles ensuring participant welfare and scientific integrity
47
- experimental unit,Smallest independent unit receiving treatment assignment
48
- exploratory data analysis (EDA),Initial data examination to understand patterns before formal testing
49
- facet_grid,Creates grid of plots by two categorical variables in ggplot2
50
- facet_wrap,Creates small multiples by single variable for quick comparisons
51
- factorial design,Tests multiple factors and their interactions simultaneously
52
- false discovery rate,Expected proportion of false positives among rejected hypotheses
53
- field study,Research in natural environment with ecological validity
54
- filter(),dplyr function to subset rows based on conditions
55
- fitted values,Model predictions for each observation in regression
56
- Fligner-Killeen test,Non-parametric test for equal variances across groups
57
- generalized linear model (GLM),Extension of linear models for non-normal response distributions
58
- genomics and molecular methods,CRISPR gene editing RNA-seq ChIP-seq proteomics single-cell analysis
59
- geom_bar,Bar chart layer for categorical data in ggplot2
60
- geom_boxplot,Boxplot layer for group comparisons in ggplot2
61
- geom_histogram,Histogram layer for distribution visualization
62
- geom_point,Scatterplot layer for continuous relationships
63
- geom_smooth,Adds regression line or smoothed curve to plots
64
- ggplot2,R package for creating layered graphics using grammar of graphics
65
- group_by(),dplyr function to perform operations by groups
66
- heteroscedasticity,Unequal variance violating assumptions of parametric tests
67
- histogram,Shows distribution of continuous variable using bins
68
- homoscedasticity,Equal variance assumption for groups or across predictor range
69
- hypothesis testing framework,Structured approach to testing claims using null and alternative hypotheses
70
- IACUC,Institutional Animal Care and Use Committee overseeing vertebrate research ethics
71
- in vitro,Experiments in controlled environment outside living organism
72
- in vivo,Experiments conducted in living organisms
73
- informed consent,Ethical requirement for human subjects to voluntarily agree to participate
74
- Institutional Review Board (IRB),Committee ensuring ethical standards in human subjects research
75
- intercept,Predicted y value when x equals zero in regression equation
76
- interquartile range (IQR),Range between 25th and 75th percentiles robust to outliers
77
- iris dataset,Classic R dataset with 150 flower measurements for classification examples
78
- Kolmogorov-Smirnov test,Tests if sample comes from specified distribution like normal
79
- kurtosis,Measure of distribution tail heaviness relative to normal
80
- lambda (λ),Transformation parameter in Box-Cox determining optimal power
81
- leverage,Measure of how extreme predictor values are potential for influence
82
- linear regression,Models relationship between predictor and continuous response variable
83
- lm(),R function for fitting linear models returns coefficients and diagnostics
84
- log transformation,Common transformation for right-skewed data or multiplicative relationships
85
- longitudinal study,Data collected from same subjects over multiple time points
86
- marine and environmental science methods,Ocean sampling environmental DNA water quality assessment climate modeling
87
- MASS package,R package containing functions for modern applied statistics
88
- mean,Average value sum divided by n central tendency measure used in t-tests ANOVA
89
- median,Middle value when ordered robust central tendency measure for boxplots IQR
90
- microbiology and immunology methods,Flow cytometry ELISA viral quantification microbiome analysis antibiotic resistance testing
91
- mode,Most frequent value in dataset third measure of central tendency
92
- model diagnostics,Checking assumptions through residual plots QQ plots and formal tests
93
- multiple comparisons problem,Increased Type I error risk when conducting multiple tests
94
- multiple regression,Linear model with two or more predictor variables
95
- mutate(),dplyr function to create or modify columns
96
- negative control,Treatment known to have no effect checks for artifacts
97
- neuroscience methods,Electrophysiology fMRI optogenetics behavior tracking connectomics analysis
98
- normality,Bell-shaped Gaussian distribution assumption for parametric tests checked Week 3
99
- null hypothesis,Statement of no effect or no difference to be tested
100
- observational study,No treatment manipulation only observation of existing variation
101
- observer bias,Researcher expectations influence data collection or interpretation
102
- one-sample t-test,Tests if sample mean differs from hypothesized population value
103
- open science,Transparency practices including data sharing preprints reproducible code
104
- ordinary least squares (OLS),Method minimizing sum of squared residuals to fit regression line
105
- outlier,Data point substantially different from other observations
106
- p-value,Probability of obtaining results as extreme as observed if null hypothesis true
107
- paired t-test,Compares matched observations like before-after measurements
108
- Palmer Penguins dataset,Modern alternative to iris with 344 penguin measurements
109
- parametric tests,Statistical tests assuming specific probability distributions
110
- pilot study,Small preliminary study testing feasibility and methods
111
- pipe operator (|> or %>%),Chains functions together for readable workflows in R
112
- plant biology methods,Photosynthesis measurement growth assays metabolomics gene expression tissue culture
113
- plot(),Base R function for creating diagnostic plots from lm objects
114
- positive control,Treatment known to produce effect validates experiment
115
- post-hoc tests,Pairwise comparisons following significant omnibus test like ANOVA
116
- power (1-β),Probability of correctly rejecting false null hypothesis
117
- power analysis,Calculates needed sample size given expected effect alpha and power
118
- powerTransform(),car package function to find optimal Box-Cox lambda value
119
- pre-registration,Publishing study design and analysis plan before data collection
120
- predictor variable,Independent variable used to predict outcome in regression
121
- protected health information (PHI),Confidential patient data requiring special ethical handling
122
- pseudoreplication,Incorrectly treating non-independent observations as replicates
123
- QQ plot,Graphical method comparing data distribution to theoretical normal
124
- quasi-experimental design,Lacks random assignment but seeks causal inference
125
- R programming language,Statistical computing environment widely used in biological research
126
- R squared,Proportion of variance explained by regression model
127
- random sampling,Selection where each member has equal probability of inclusion
128
- randomization,Random assignment to treatments prevents systematic bias
129
- randomized controlled trial (RCT),Gold standard experimental design with random treatment assignment
130
- range,Maximum minus minimum quick variability check sensitive to outliers
131
- regression assumptions,Requirements including linearity normality and constant variance
132
- regression diagnostics,Tools for checking model assumptions using residuals and influence measures
133
- repeated measures design,Same subjects measured under multiple conditions reduces variance
134
- replication,Multiple independent observations per treatment group essential Week 9 concept
135
- research misconduct,Fabrication falsification plagiarism violations of scientific integrity
136
- residual standard error,Estimate of standard deviation of residuals around regression line
137
- residuals,Differences between observed and predicted values in regression
138
- response variable,Dependent variable being predicted in regression analysis
139
- sample size (n),Number of independent observations affects power and uncertainty
140
- sampling distribution,Distribution of sample statistics across repeated sampling
141
- scatterplot,Plots two continuous variables to show relationships
142
- select(),dplyr function to choose specific columns from data frame
143
- Shapiro-Wilk test,Statistical test for normality effective for small to moderate samples
144
- simple linear regression,Model with single predictor and continuous response
145
- skewness,Asymmetry in distribution with longer tail on one side
146
- slope,Rate of change in y per unit change in x regression coefficient
147
- sqrt transformation,Square root transformation for count data or moderate skew
148
- standard deviation,Average spread of data points around the mean
149
- standard error,Standard deviation of sampling distribution measures precision
150
- statistical methods in biomedicine,Clinical trials survival analysis epidemiology biomarkers meta-analysis
151
- statistical significance,Result unlikely due to chance alone typically p < 0.05
152
- stratification,Dividing population into subgroups before sampling
153
- sum of squares,Total squared deviations used in ANOVA and regression calculations
154
- summarize(),dplyr function to calculate summary statistics
155
- summary(),R function displaying model coefficients tests and fit statistics
156
- systems biology methods,Network analysis metabolic modeling multi-omics integration pathway analysis
157
- t-statistic,Test statistic for t-tests ratio of effect to standard error
158
- technical replicate,Multiple measurements of same unit not true replication
159
- three Rs principle,Replacement reduction refinement in animal research ethics
160
- tidy(),broom function converting model output to tidy data frame
161
- tidyverse,Collection of R packages for data science including ggplot2 and dplyr
162
- transformation parameter,Value like lambda determining type and strength of transformation
163
- Tukey HSD,Post-hoc test for pairwise comparisons after significant ANOVA
164
- two-sample t-test (unpaired),Compares means of two independent groups
165
- Type I error,False positive rejecting true null hypothesis
166
- Type II error,False negative failing to reject false null hypothesis
167
- variance,Square of standard deviation measuring data dispersion
168
- violin plot,Combines boxplot with kernel density to show distribution shape
169
- Welch's t-test,Modified t-test for unequal variances between groups
170
- Winsorization,Replacing extreme values with less extreme ones to reduce outlier impact