







Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Descriptive Statistics, Representing and Benchmarking, Mean, Median, Mode, Average, Most common value, Bimodal Distributions, Mixtures, Outliers, Failures of Mean are key points of this lecture.
Typology: Slides
1 / 13
This page cannot be seen from the preview
Don't miss anything!
Lecture 3: Representing & Benchmarking: Mean, Median and Mode
Data Summary REVEALS central aspect, and CONCEALS diversity. It is necessary only because we don’t have sufficient mental capacity to look at whole data set all at once. Many different ways of summarizing data. Using one number in place of whole data ALWAYS conceals more than it reveals.
Pool all resources. Divide them equally among all. Everyone’s share is the mean or average. Representative number IF there is near equality in the population. Not representative if there is substantial inequality, or diversity.
Sort data set: Middle value is median. If even number of data, then average of the two middle values. Illustrate by sorting, do both manually and in EXCEL.
Look at heights data: which is the most common measure for height? HIES data set on Family Size. What is the most common family size?
When two populations are mixed in data set, bimodal distrbutions result. GDP Data is bimodal. Plot histogram in EXCEL and show. Explain that there are rich countries and poor countries – two groups. Artificial Heights Data is bimodal. Plot already included. Display.
Mean does not represent or benchmark in bimodal distributions. Same is true of median. Illustrate by examples with GNP data set. Illustrate by example in lecture.
Modes are useful to identify cases of mixtures. Modes work as representatives in bimodal or multimodal distributions. In this case both mean and median fail.