Describing Data Grapically and Numerically

shussainalawi231 15 views 59 slides Sep 25, 2024
Slide 1
Slide 1 of 59
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59

About This Presentation

Overview:

This presentation aims to provide a comprehensive understanding of how to effectively describe data using both graphical and numerical methods. We will explore various techniques and tools that can help you visualize and analyze data in a meaningful and impactful way.

Key Topics:

Introd...


Slide Content

Chapter 2 – Book 4 Describing Data Grapically and Numerically STAT 276

هذه المذكرة (النوتات) عبارة عن تجميع للمعلومات من الكتاب وما تمكنت انا وزملائي من تسجيله اثناء المحاضرات. درسنا هذه المادة في اول فصل من السنة 2023-2024 و من الطبيعي ان يتغير محتوى المقرر في الفصول اللاحقة. فيجب أن: لا تعتمد على المذكرة بشكل أساسي وارجع للكتب ممكن ان تُحذف او تُضاف بعض الأشياء في الفصول اللّاحقة، فتأكد أنك تدرس ما هو مطلوب فقط معلومة مهمة: في هذا الفصل الدراسي بالتحديد يوجد هناك كتاب رئيسي للمادة، بالإضافة إلى أربع كتب أخرى. والتي يؤخذ منها بعض المعلومات، وسوف اذكر اسم الكتاب في بداية كل فصل ممكن انني أخطأت في شي ما، وسبحان من لا يخطئ. من هذا المنطلق، من واجبك كطالب، التأكد من المعلومات الموجودة في المذكرة. يوجد في آخر ( سلايد ) اسم حسابي في برنامج ( Discord )، في حال وجود أي خطأ ارجوا التواصل معي هذه المذكرة مستوحاة من (نوتات المهندس )، ألهمني عمله أن ابدأ العمل على هذه النوتات. تمنياتي لكم بالتوفيق والنجا ح

Table of contents 2 .1 2.3 2.2 Getting started with Statistics Frequency Distribution Tables for Qualitative & Quantitative data Classification of various types of data

What is Statistics ? The term statistics is commonly used in two ways. On one hand, we use it in day-to-day communication when we refer to the collection of numbers or facts. Examples On average, the starting salary of engineers is 40% higher than that of technicians In December 2009, a total of 43 states reported rising jobless rates In 2000, the salaries of CEOs from 10 selected companies ranged from $2 million to $5 million.

On the other hand, statistics is a scientific subject that provides the techniques of: collecting , organizing , summarizing , analyzing , and interpreting the results as input To make appropriate decisions. descriptive statistics inferential statistics. In General the subject of statistics can be divided into two parts:

Descriptive Statistics Descriptive statistics uses techniques to Organize , summarize , analyze , and interpret the information contained in a data set. to draw conclusions that do not go beyond the boundaries of the data set

Inferential Statistics uses techniques that allow us to draw conclusions about a large body of data . based on the information obtained by analyzing a small portion of these data.

Population and Sample in a Statistical Study 2.1. 2

In a very general sense, Statistics is the science of collecting and analyzing data The tradition of collecting data is old. European governments began keeping records of births, deaths and marriages four centuries ago . While techniques of analyzing data have been developed only in the twentieth century . The invention of computers made it possible to use these techniques routinely .

The collection and analysis of various kinds of data has become essential in these fields Agriculture Pharmaceuticals Business Medicine Engineering Manufacturing Product distribution Government or nongovernment agencies

A population is a collection of all elements that possess a characteristic of interest Defintion of a Population Could be finite infinite

A portion of a population selected for study Defintion of a Sample

The population we want to study Or the population about which we want to make inferences based on the information contained in a sample . Defintion of a Target Population

The population we took the sample from Or The population from which a sample is being selected Defintion of a Sampled population

Usually, the sampled population and target population coincide. But In certain situations , they may not coincide . causing conclusions to be un applicable to the target population. So, it is important to ensure the sampled population is equivalent to the target population.

Why would they not coincide ?

Financial reasons Time constraint and so forth… a part of the population not being easily accessible the unexpected loss of a part of the population

The problem with collecting all conceivable values of interest on all elements is that populations are usually so large that examining each element is not feasible In a typical field, there is often need to collect numerical information on all elements of interest which is usually referred to as the population .

examining each bulb means that we must wait until each bulb dies. Thus, it is unreasonable to collect data on all the elements of interest. Example… suppose that we are interested in determining the breaking strength of the filament in a light bulb manufactured by a particular company. Filament

For instance, to study family incomes in the United States, a representative sample should consist of families from different income levels . In statistical studies, the conclusions about a population are based on the information drawn from a sample So, a representative sample is crucial for studying a population, as it should possess the characteristics of the population under investigation. such as very poor , poor , middle class , rich , and very rich. A random sample is an effective approach to achieve this goal.

A sample is called a simple random sample if each element of the population has the same chance of being included in the sample. Defintion of a Random Sample

Sample Designs Also called techniques of selecting a random sample .

Random Sampling Simple Systematic Stratified Cluster The concept that each element of the population has the same chance of being included in a sample, forms the basis of all random sampling

Collecting each data point costs time and money , So it’s important to balance between sample size and resources available when taking a sample. Sample t oo small = not much useful information. Sample too large = waste of resources. Thus, it is very important that in any sampling procedure, an appropriate sampling design is selected.

Sampling units… Taking a sample involves dividing the target population into nonoverlapping units , known as sampling units . It's crucial to recognize that these units may not always be the same . they are determined by the chosen sample design.

For example , in sampling voters in a metropolitan area, the sampling units might be individual voters , all voters in a family , all voters living in a block , or all voters in a town

A list of all sampling units Defintion of a Sampling Frame

Simple Random Sampling consists of selecting a number of sampling units in such a way that each sampling unit has the same chance of being selected. Population Sample

Example… Simple random sampling: An engineer wants to sample machine parts manufactured during a shift at a plant. As all parts are manufactured during the same shift , it is safe to assume that all parts are representative. Hence in this case, a simple random sampling design should be appropriate..

Stratified Random Sampling These strata are treated as subpopulations , with similar sampling units but differing from one another The stratified random sampling design is a sampling method that divides a population into nonoverlapping groups called strata. This design offers improved results for the same amount of money spent on simple random sampling. In the manufacturing world, this type of sampling situation arises often Multiple strata are called Stratum

Example… Stratified random sampling: Selected samples from a population of parts manufactured in different plants or shifts can be analyzed using stratified random sampling . W hich offers administrative convenience and is more appropriate than simple random sampling. This method is particularly beneficial for machine parts manufactured in different parts of the country.

Stratified Random Sampling

Systematic Random Sampling This sampling scheme is particularly useful in manufacturing processes . When the sampling is done from a continuously operating assembly line. Under this scheme, a first item is selected randomly, then every m th item manufactured is selected until we have a sample of the desired size.

Systematic Random Sampling

Cluster Random Sampling Cluster random sampling is the fourth and final sampling design. where each unit is a group of smaller units. This is particularly useful in manufacturing environments where it is challenging to prepare a list of each part of a frame. Instead, a simple random sample of boxes with many parts can be prepared. Cluster sampling allows for a large sample of smaller units at minimum cost, as both preparing the frame and taking the sample are more economical.

Population Sample Cluster Random Sampling

A variable is a characteristic of interest that may take different values for different elements Defintion of a Variable In preparing any frame, we must precisely define the characteristic of interest or variable , where a variable may be defined as follows:

Example A doctor is interested in finding the ages , heights , weights , GPA , and gender of all the students in his class. Thus, the variables (characteristics of interest) are ages , heights , weights , GPA and gender حل عدل لا الشطك

CLASSIFICATION OF VARIOUS TYPES OF DATA 2. 2

This data can include: customer satisfaction employee comments employee phone numbers weekly production volume. Data collection is a common practice, often involving nonnumerical and/or numerical data. However, all data collected cannot be treated the same way due to differences in types of data. Statistical data can be divided into two major categories .

Quantitative Data “Numerical” Qualitative Data “Categorical” Nominal Data Ordinal Data Interval Data Ratio Data Minimum information Maximum information

Here is a video that explains this topic شباب ، في هذا الفيديو الدكتورة غير محجّبة (676) Nominal, Ordinal, Interval & Ratio Data: Simple Explanation With Examples - YouTube (676) Scales of Measurement - Nominal, Ordinal, Interval, & Ratio Scale Data - YouTube هذا بديل Check Page 18 of book 4 for more information and examples There are also 6 practice problems in page 19-20

FREQUENCY DISTRIBUTION TABLES FOR QUALITATIVE AND QUANTITATIVE DATA 2. 3

In statistical applications, large amounts of messy data need to be organized and summarized using a frequency distribution table. This table is used to understand the nature of qualitative or quantitative data (categorical or numerical data). This section discusses the construction of a frequency distribution table for categorial or numerical data

2.3.1 Qualitative Data A frequency distribution table for qualitative data consists of two or more categories. Along with the numbers of the data that belong to each category. The number of data belonging to any category is called the frequency of that category.

Example… Consider a random sample of 110 small to midsize companies located in the United States and classify them according to their annual revenues (in millions of dollars). Then construct a frequency distribution table for the data obtained by this classification. We classify the annual revenues into five categories as follows: Under 250 , 250–under 500 , 500–under 750 , 750–under 1000 , 1000 or more. Then we will label each category with the numbers 1 , 2 , 3 , 4 , 5. Solution:

After tallying the data, we find that 28 companies belong in the first category, 26 in the second category, 20 in the third category, 16 in the fourth category, and 20 in the last category. Thus, a frequency distribution table for the data in Table 2.3.1 is as shown in Table 2.3.2

Interestingly we can use technology on data in Table 2.3.1 to produce Table 2.3.2

2.3.2 Quantitative Data To construct a frequency distribution table for a quantitative data set , we follow the steps given below. Find number of classes “m” Find Range “R” Find class width Assign each data point to only one class and make sure to include all data points in the table

Range = R = largest data point − smallest data point Number of classes = m = √n where n is the total number of data points in a data set the number of classes, should always be a whole number (you must round up or down) Class width = R/m The class width should preferably be a whole number that is easy to work with. this number should be obtained only by rounding up (never by rounding down) 1 2 3

The frequency distribution table is prepared by assigning each data point to a class. ensuring that each data point is assigned to one class only and all data points are included in the table. The lowest class must begin with a number ≤ the smallest data point, The highest class must end with a number ≥ the largest data point in the data set.

Prepare a frequency distribution table for these data.

Solution: Range = R = 152 − 110 = 42 Number of classes = m = = 6. 3 ≈ 6 Class width = R/m = 42/6=7   The six classes used to prepare the frequency distribution table are as follows: [110–117), [117–124), [124–131), [131–138), [138–145), [145–152].

Each class is defined by two numbers : the lower limit and the upper limit. The upper limit does not belong to the class except for the last class. ensuring that no two classes have any common point. ensuring each data point belongs to only one class.

بالتوفيق Is there anything wrong in the slides ? Talk to me on Discord (I speak both English and Arabic) my username: _sayed_

This presentation has been made using the following fonts: Lilita One ( https://fonts.google.com/specimen/Lilita+One ) Didact Gothic ( https://fonts.google.com/specimen/Didact+Gothic ) # 2e4460 # f6ffff # 686c6d # 30abb3 # eab246 # e74449 # e2e1df Fonts & colors used