Spaces:
Sleeping
Sleeping
Upload week4_running.csv
Browse files- week4_running.csv +85 -0
week4_running.csv
ADDED
|
@@ -0,0 +1,85 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
TERM,CONTEXT
|
| 2 |
+
Help with a code bug in R,For this option you should prompt the user to copy and paste the relevant code and the error as it appears in the console. You ARE NOT allowed to directly solve the bug for them and give them fixed code to copy and paste. Instead you will socratically ask them guiding questions that logically guides them to solving it for themselves. Use palmer penguins for your examples. The issue will be in relation to RStudio R and Rmd files.
|
| 3 |
+
Testable Hypothesis,A hypothesis that can be evaluated through empirical evidence and experimentation.
|
| 4 |
+
Falsifiable Hypothesis,A hypothesis that allows for the possibility of being disproven by data.
|
| 5 |
+
Prediction vs. Hypothesis,A prediction is a specific expected outcome of an experiment based on a hypothesis.
|
| 6 |
+
Environmental Factors in Foraging Behavior of white-footed mice,Includes habitat type - availability of refuge - predator presence - moonlight - and temporal effects.
|
| 7 |
+
Scientific Research Question,A specific and clear question that guides an investigation and is based on observations. Include information that is specific to the population experimental design and collected data
|
| 8 |
+
Background Research in Science,The step where existing literature and prior knowledge are reviewed to refine research questions.
|
| 9 |
+
Keefe’s Hypothesis Trick,A strategy for writing clear hypotheses by imagining the axes and predicted outcomes in a graph. For example - X axis is positively correlated with Y axis
|
| 10 |
+
Alternative Hypothesis (Hₐ),A statement suggesting that there is a meaningful relationship or effect between variables.
|
| 11 |
+
Null Hypothesis (H₀),The assumption that there is no effect or difference unless proven otherwise. It is easier to disprove a null than prove an alternative hypothesis
|
| 12 |
+
Parametric statistical tests,
|
| 13 |
+
Three Assumptions of most parametric stat tests, Normality Homoscedasticity (equal variance between groups) and Data Independence.
|
| 14 |
+
skewness, of a distribution; skewed left or right
|
| 15 |
+
kurtosis, of a distribution; platykurtic and leptokurtic
|
| 16 |
+
Common data transformations, square root; natural log; log10; multiplicative inverse; or rank transformation as a last resort
|
| 17 |
+
How to choose the right transformation?, Choose the transformation that best meets normality assumptions while preserving interpretability.
|
| 18 |
+
back transformation, why is this necessary in stat after a transformation
|
| 19 |
+
The Kolmogorov-Smirnov (KS) test,
|
| 20 |
+
The Shapiro-Wilks test,
|
| 21 |
+
The D'Agostino's K^2 test,
|
| 22 |
+
How robust is the assumption of normality?, very unless the sample size is low or the data is highly skewed; Do non-parametric test and parametric tests and see if the results agree.
|
| 23 |
+
Continuous probability distributions common in biological systems,
|
| 24 |
+
Discontinuous probability distributions common in biological systems,
|
| 25 |
+
distributions,Within the context of statistics and populations
|
| 26 |
+
hypothesis testing,Within the context of statistics and populations
|
| 27 |
+
null hypothesis,Within the context of statistics and populations
|
| 28 |
+
The wisdom of crows,Within the context of statistics and populations
|
| 29 |
+
Stabilizing selection,an example of a mechanism that creates normal distributions in nature.
|
| 30 |
+
The Central Limit Theorem,Within the context of statistics and populations
|
| 31 |
+
Standard error,Standard error of the mean tells you how accurate your estimate of the real mean it is likely to be.
|
| 32 |
+
Confidence intervals,
|
| 33 |
+
The difference between standard error and standard deviation,
|
| 34 |
+
Data Visualization in Biology,Questions to ask yourself: How does my data vary? Are variables correlated? Are there outliers?
|
| 35 |
+
The Palmer Penguins dataset,the primary example dataset used in class. Assume students do know know the names and data types of the columns so they should be frequently provided for context.
|
| 36 |
+
bar plot,
|
| 37 |
+
scatterplot,
|
| 38 |
+
line graph,
|
| 39 |
+
histogram,
|
| 40 |
+
box plot,
|
| 41 |
+
violin plot,
|
| 42 |
+
heat map,
|
| 43 |
+
pie charts,this is an awful choice for many reasons.
|
| 44 |
+
O-ring failure on the Challenger and bad data visualization,
|
| 45 |
+
ggplot2 and the grammar of graphics,
|
| 46 |
+
descriptive statistics,
|
| 47 |
+
robust,this is in reference to the robustness of a model or descriptive statistic.
|
| 48 |
+
centrality and variation in statistics,
|
| 49 |
+
interquartile range,
|
| 50 |
+
standard deviation,
|
| 51 |
+
variance,this is in reference to the descriptive statistic.
|
| 52 |
+
sum of squares,
|
| 53 |
+
Anscombe’s quartet,
|
| 54 |
+
range,this is in reference to the descriptive statistic.
|
| 55 |
+
When should you look at your data?,Early and often with data visualization. Science is an iterative process where the initial experimental plans often won't work because of some unknown patterns in your population. Examining your sample visually can help show those to you - like outliers or weird distributions.
|
| 56 |
+
Science,Science is the pursuit and application of knowledge and understanding of the natural and social world following a systematic methodology based on evidence.
|
| 57 |
+
PPDAC - Problem,
|
| 58 |
+
PPDAC - Plan,
|
| 59 |
+
PPDAC - Data,
|
| 60 |
+
PPDAC - Analysis,
|
| 61 |
+
PPDAC - Conclusion,This step cycles back to 'PPDAC - Problem' because science is an iterative process that creates new questions and directions.
|
| 62 |
+
Biological biases,
|
| 63 |
+
Cognitive biases,
|
| 64 |
+
Confirmation bias,
|
| 65 |
+
Availability heuristic,
|
| 66 |
+
Anchoring bias,
|
| 67 |
+
Dunning-Kruger effect,
|
| 68 |
+
Why are statistical and programming knowledge useful for a career in biology?,1. Biological data is messy and complex. 2. Biology uses BIG data. 3. These skills save lives. 4. These skills will help your career.
|
| 69 |
+
Big data in biology,
|
| 70 |
+
Name a disease that is influenced by many different genetic and environmental factors,
|
| 71 |
+
a priori hypotheses,
|
| 72 |
+
Which natural process is most similar to machine learning?," ""evolution by natural selection"""
|
| 73 |
+
How will careers in biology be affected by generative AI?," ""The barriers to learning new skills are falling. The need to learn programming and data analysis is greater now than it was in 2020! Some jobs will be automated. Those with the rarest and most useful combination of skills will be sought after."""
|
| 74 |
+
Data types - continuous/numerical,
|
| 75 |
+
Data types - count/integer,
|
| 76 |
+
Data types - ordinal,
|
| 77 |
+
Data types - categorical,
|
| 78 |
+
Data types - binomial,
|
| 79 |
+
Null data,Data missing in a dataset. It is important to appropriately deal with this data depending on the nature of the statistical analysis and experimental design.
|
| 80 |
+
tidy data,
|
| 81 |
+
R programming - objects,
|
| 82 |
+
R programming - functions,
|
| 83 |
+
R programming - Rmd file format,
|
| 84 |
+
RStudio,
|
| 85 |
+
print() in R,
|