PUBH2007 Lecture Notes - Lecture 3: Prostate Cancer Screening, Random Assignment, Cluster Sampling

32 views9 pages

aquamarinewasp28

1 Jun 2018

School

Department

Public Health

Course

PUBH2007

Professor

Dr Michael Waller

For unlimited access to Class Notes, a Class+ subscription is required.

PUBH2007 LECTURE THREE

This week:

• Reasons for sampling

• Different methods of sampling

• Design of experiments and other types of studies

Last two weeks: exploratory data analysis

Now: statistical inference

Exploratory Data Analysis:

• Purpose: unrestricted exploration of the data, searching for interesting patterns

• Conclusions apply only to the individuals and circumstances for which we have data

• Conclusions: informal, based on what we see in the data

Statistical inference:

• Purpose: to answer specific questions, posed before the data were produced

• Conclusions apply to a larger group of individuals or broader class of circumstances

• Conclusions: formal, backed by a statement of our confidence in them

Populations and samples: direction of inference

Population (happy and sad) -> Sampling -> Sample (all happy)

• Need to express uncertainty about the assumptions we make i.e. confidence intervals

Populations and Samples - example

Population we want information about - entire group of interest e.g. everyone in Australia with asthma

(target population)

Population actually studied - subgroup of target population available for study e.g. people obtaining

asthma medication in pharmacies

Sample selected - people selected for study e.g. people obtaining asthma medication from 20 selected

pharmacies in Brisbane city

Sample on whom data are collected - e.g. people obtaining asthma medication from these 20 selected

pharmacies in Brisbane City on 9th August 2016

Population and Sample - definitions

The distinction between population and sample is basic to statistics. To make sense of any sample result,

you must know what population the sample represents.

find more resources at oneclass.com

Unlock document

This preview shows pages 1-3 of the document.
Unlock all 9 pages and 3 million more documents.

Already have an account? Log in

Population: in a statistical study; the entire group of individuals about which we want information

Sample: the part of the population from which we actually collect information. We use information from

a sample to draw conclusions about the entire population

Sampling design: describes exactly how to choose the sample from the population

Non-random methods of sample selection

Volunteers : e.g. SMS votes for TV show are likely to be biased by people who hold strong views, certain

age groups

Convenience sample: e.g. 'intercept' sample in shopping centre - liable to bias - who is in the shopping

centre at the particular time? Near which shops?

Haphazard sample: selected by some process that does not use random numbers

Non-random methods of sample selection - issues

1. Usually we do’t kow who the actual target population are exactly

2. Hence it is difficult to assess true extent of biases

3. Sampling method may not be reproducible

Randomisation

• Idea due to R. A. (Sir Ronald) Fisher

• Use impersonal CHANCE to assign participants to groups

E.g. tossing coin/drawing name from hat

• We use a table of random numbers to illustrate the idea

• In practice use a computer to generate the random numbers

SIMPLE RANDOM SAMPLE (SRS): of size n consists of n individuals from the population chosen in such a

way that every set of n individuals has an equal chance to be the sample actually selected

• Assign study ID numbers to 12 study participants

• Group 1 and Group 2

• Take pairs of numbers (because we want values between 01 and 12)

• Discard 11, those greater than 12, and any already selected

• Continue generating random numbers in random number table until you have 6 people in Group 1

(based on random number table)

• Assign those that have not been selected for Group 1 to Group 2

Table of random numbers

• Digits , ,  …. 9 ae listed so that the digit i ay positio has a eual hae of eig , ….9

• Digits in different positions are independent, i.e. value in one doesn't influence value in any other

• Any pair of numbers has an equal chance of being 0,  ….99

find more resources at oneclass.com

Unlock document

This preview shows pages 1-3 of the document.
Unlock all 9 pages and 3 million more documents.

Already have an account? Log in

• Ay tiple of ues has a eual hae of eig , ….999

Advantages of random selection

• Can be used for choosing a sample of participants for a survey or for allocating participants to

groups for a comparative study

• Uses chance to select participants or allocate participants to groups

• Prevents bias or systematic favouritism

• Creates groups which are, on average, similar before any intervention is applied, so interpretation of

the study findings is clearer

• Can be done efficiently by computer (much better than similar example above) which can cope with

much more complex designs

• Easy to explain and reproducible

• Minimises bias (provided you can collect valid data form every participant selected in the sample)

Simple random sampling - issues

• You need a SAMPLE FRAME i.e. list of target population - often very difficult to obtain

• Impractical in many situations

Observation vs Experiment

Observational study: we observe individuals and measure variables of interest but do not attempt to

influence the responses. The purpose is to describe some group or situation.

Experiment: deliberately impose some treatment on individuals to measure their responses. The

purpose is to study whether the treatment causes a change in the response.

When the goal is to understand cause and effect, experiments are the only source of fully convincing

data. But results may not be generalisable to all situations.

Case report:

• Unusual observation about a single case with wider implications

• E.g. adverse reaction to treatment, strange signs/symptoms for known condition, new disease

• Can alert other people to look for other cases or causes

• OBSERVATIONAL STUDY

Case series:

• Data from a number of cases (e.g. breast cancer cases among women at ABC studio in Toowong)

• Usually there isn't a clear testable research question

• May be biases, unrepresentative or not generalisable to other settings

• May lead to formulation of testable hypothesis for further studies

• OBSERVATIONAL STUDY

Cross-sectional study or sample survey

find more resources at oneclass.com

Unlock document

This preview shows pages 1-3 of the document.
Unlock all 9 pages and 3 million more documents.

Already have an account? Log in

Document Summary

This week: reasons for sampling, different methods of sampling, design of experiments and other types of studies. Exploratory data analysis: purpose: unrestricted exploration of the data, searching for interesting patterns, conclusions apply only to the individuals and circumstances for which we have data, conclusions: informal, based on what we see in the data. Statistical inference: purpose: to answer specific questions, posed before the data were produced, conclusions apply to a larger group of individuals or broader class of circumstances, conclusions: formal, backed by a statement of our confidence in them. Population (happy and sad) -> sampling -> sample (all happy: need to express uncertainty about the assumptions we make i. e. confidence intervals. Population we want information about - entire group of interest e. g. everyone in australia with asthma (target population) Population actually studied - subgroup of target population available for study e. g. people obtaining asthma medication in pharmacies.

PUBH2007 Lecture Notes - Lecture 3: Prostate Cancer Screening, Random Assignment, Cluster Sampling

Document Summary

Get access

Related Documents

Statistical Sciences 2244A/B Study Guide - Quiz Guide: Statistical Inference, Statistical Parameter, Sampling Error

Biology 2244A/B Lecture Notes - Inference, Combined Oral Contraceptive Pill, Confidence Interval

STAT202 Study Guide - Final Guide: Confidence Interval, Interval Estimation, Statistical Parameter