BIO3011 Lecture Notes - Lecture 1: Bar Chart, Scatter Plot, Normal Distribution

53 views3 pages

graywoodchuck63

31 May 2018

School

Monash University

Department

Science - Biology

Course

BIO3011

Professor

Christopher Johnstone

For unlimited access to Class Notes, a Class+ subscription is required.

1. Variables and data

• Variable: characteristic measured on individuals drawn from a population under study

Types of variables:

o Response = dependent (variable we are interested in examining – responds to

manipulation) I.e. usually numerical

o Explanatory = independent (variable we are manipulating or measuring in order to

observe its effect on the response variable)

• Data: measurements of one or more variables made on a collection of individuals

• Population = total number of individuals that are used to summarise/describe a group of

measurements eg. mean, median, sd, se

o Parameters = summary describing the population eg. true mean

o Population parameters are constant

• Sample = much smaller set of individuals from the population (is representative of the

population)

o Statistics/estimate = approximation (estimate) of the truth – is subject to error

-estimates value of the true body size

-uses statistics to determine how good our estimates are

o The larger the sample size = the more certainty

o Estimates are random variables

o Properties of a good sample:

1. Independent selection of individuals eg. number each individual then choose

random numbers

2. Random selection of individuals

3. Sufficiently large

• Random sampling = each member of a population has an equal and independent chance of

being selected

• Bias = systematic discrepancy between estimates and the true population characteristic

o Volunteer bias = volunteers for a study are likely to be different, on average, from the

population

eg. volunteers for medical studies may be sicker than the general population

• Sampling error = difference between the estimate and the average value of the estimate

i.e. the difference between an estimate and the population parameter are being estimated by

chance

• Larger samples on average will have smaller sampling error

find more resources at oneclass.com

Unlock document

This preview shows page 1 of the document.
Unlock all 3 pages and 3 million more documents.

Already have an account? Log in

Document Summary

Variables and data, variable: characteristic measured on individuals drawn from a population under study. Uses statistics to determine how good our estimates are: the larger the sample size = the more certainty, estimates are random variables, properties of a good sample: Larger samples on average will have smaller sampling error: two most common descriptions of data, location (central tendency) Tell us about the average or typical individual. Median = middle measurement in set of ordered data. Mode = most frequent measurement: spread (variation) Tells us how variable the measurements are from individual to individual (how different the individuals are) Gives us perspective: how large are the differences between groups compared to variation with groups. Range = max min (biased small samples tend to give lower estimates of the range than large samples) Standard deviation = positive root of the variance. Samples are not independent: type i error = rejecting a true null hypothesis.

Related Questions

Part II 1.The California Occupational Mortality study data set was employed to assess mortality data for the years 1979-1981. A 2% sample of employed persons from the 1980 census of California was used. It contains the occupation of each person who had died. This study, like any occupation-based study, is restricted to the work life span as far as age of subjects is concerned. This study used ages 16-64 years. Certain persons, because of their source of work, were excluded as subjects: homemakers, retired persons, students, disabled, military personnel, etc. Only main-stream-type employment was used in this study to establish occupations at risk for heavy alcohol drinking. Using the Table below, answer the following:

1.What occupations are most vulnerable to acquiring cirrhosis of the liver from heavy drinking?

2.List several of the occupations identified by the research studies that are more vulnerable to alcohol-induced, cirrhosis related deaths and that have the highest mortality rates.

3. What are possible confounding variables for the research in occupations and heavy drinking that lead to fatal diseases?

4. Based on available research data, develop and construct a web of causation along with the appropriate decision trees for occupational group-related alcohol deaths.

Table. Alcohol Related Deaths in California, 1979-1981 Cirrhosis 5.5% Digestive organ cancers 5.7% Injuries 13.7% Suicide and homicide 5.% All other causes 64.9% 2. Zika and microcephaly Since May 2015,

Brazil has experienced a significant outbreak of Zika virus. In recent years, Brazilian officials reported an increase in the number of babies born with microcephaly.

1. Briefly describe what we know and what we do not know about Zika virus.

2. What has been done and what should be done to prevent Zika virus epidemic in US and world-wide?

Multiple Choice

5. Which of the following statements describes the major advantages of a randomized clinical trial? a. It avoids observer bias b. It lends itself to ethical justification c. It yields results replicable in other patients d. It rules out self-selection of participants to the different treatment groups d. It enrolls representative patients

6. A survey conducted in England revealed that of 224 families in which there had been a known case of poliomyelitis, 56 maintained parakeets as a family pet. In another British survey, 30 out of 99 poliomyelitis patients questioned kept parakeets. The inference that there is some relationship between the presence of a parakeet in a household and the occurrence of poliomyelitis among household members is a. Correct b. Incorrect because of failure to distinguish between incidence and prevalence c. Incorrect because a proportionate ratio is used when a rate is required to support the inference d. Incorrect because a failure to recognize a possible cohort phenomenon e. Incorrect because there is no control or comparison group.

7. An investigator determines the correlation coefficient between triglyceride levels and degree of atherosclerosis in sampled blood vessels to be +1.67. On the basis of this you would conclude that: a. Triglyceride level is a good predictor of atherosclerosis b. Triglyceride level is not a good predictor of atherosclerosis c. High triglyceride levels cause atherosclerosis d. Atherosclerosis cause high triglyceride levels e. The investigator has incorrectly determined the correlation coefficient

8. A screening test of known sensitivity and specificity is applied to two populations. The prevalence of the disease being screened for is 10% in population A and is 1% in population B. Which of the following is true? a. The percent of all negative tests that have false-negative results is lower in population A than in population B. b. Specificity is lower in population A than in population B. c. Reliability is higher in population A than in population B. d. The percent of all positive tests that have false-positive results is lower in population A than in population B. e. Specificity is higher in population A than in population B.

9. Serum cholesterol levels are obtained for four healthy men. The probability that all will fall below the 10th percentile of the distribution of cholesterol for healthy males is: a. 0.4 b. 1 â (0.1)4 c. (0.1)4 d. (0.9)4 e. Cannot be determined from these data

10. In a diabetes screening program, the screening level for a positive blood sugar level in test 1 is set at 160 mg/dl, and in test 2 at 130 mg/dl. The sensitivity is: a. Greater in test 1 b. Greater in test 2 c. Equal in test 1 and 2 d. Dependent on the size of the population being evaluated

BIO3011 Lecture Notes - Lecture 1: Bar Chart, Scatter Plot, Normal Distribution

Document Summary

Get access

Related textbook solutions

Molecular Cell Biology

Biology: Science for Life with Physiology

Biology

Concepts of Biology

Essentials of Biology

Related Documents

BIO3011 Lecture Notes - Lecture 2: Confidence Interval, Summary Statistics, Standard Error

[EEB225H1] - Midterm Exam Guide - Ultimate 15 pages long Study Guide!

STAT 2040 Study Guide - Midterm Guide: Central Tendency, Cumulative Distribution Function, Unimodality

Related Questions