Statistics Made Simple: The Art of Arranging Data

Statistics Made Simple The Art of Arranging Data

We commonly see raw statistics in our everyday lives, such student grades, monthly bills, product sales, or survey data. But when there are too many of these statistics, it might be hard to interpret and analyze them. This is where putting data in tables comes in. We turn random numbers into useful information by putting them in rows and columns in a methodical way.

Read Also : How to Learn Statistical Data Analysis: Key Concepts and Practical Tips

What is Data Arrangement in Tabular Form?

Putting raw data into organized tables is called data arrangement in tabular form. This makes it simpler to view, compare, and analyze the data. A table usually has headers, rows, and columns, and each of them shows a distinct part of the data.

Key Features of Tabular Presentation

1. Systematic Organization â€“ Data is arranged in logical order for clarity.

2. Comparability â€“ Makes it easy to compare values across categories.

3. Compactness â€“ Large data is summarized into smaller, understandable parts.

4. Accuracy â€“ Reduces confusion and chances of misinterpretation.

5. Ease of Analysis â€“ Facilitates statistical operations like averages, frequency, and percentages.

Types of Tabular Presentation

1. Simple Table â€“ Shows data of one variable only.
Example: Marks obtained by students in a test.

2. Double Table â€“ Shows relationship between two variables.
Example: Students’ marks based on gender.

3. Complex Table â€“ Includes multiple variables.
Example: Marks by gender and subject.

4. Frequency Table â€“ Shows how often values occur.
Example: Number of students falling in different marks range

Frequency Distribution in Statistics

A frequency distribution organizes raw data into categories or intervals and shows how often (frequency) each value occurs.

Statistics Made Simple The Art of Arranging Data

1. Class Interval 
Range between two class limits showing group width; helps organize data into equal-sized categories for frequency distribution.

2. Class Limits 
The lowest and highest values that can belong to a class interval; determine class boundaries in grouped data tables.

3. Class Boundaries 
Actual limits between two class intervals, found by adjusting limits to avoid gaps; used in histograms and continuous data.

4. Class Mark (Midpoint) 
The central value of a class interval, obtained by averaging lower and upper class limits; represents data concentration point.

Types of Frequency Distributions

Raw Data Example (Marks of 20 Students in a Test out of 50)

12, 18, 25, 30, 22, 27, 35, 40, 15, 20,

28, 32, 45, 38, 25, 18, 29, 31, 24, 26

Table with Class Intervals, Limits, Boundaries, and Class Marks

Class IntervalClass Limits (L–U)Class BoundariesClass Mark (Midpoint)Frequency
10–1910 – 199.5 – 19.514.56
20–2920 – 2919.5 – 29.524.58
30–3930 – 3929.5 – 39.534.54
40–4940 – 4939.5 – 49.544.52
Total   20

a. Simple Frequency Distribution

b. Shows the count of each value or class.

c. Example:

MarksFrequency (No. of Students)
10–196
20–298
30–394
40–492
Total20
Simple Frequency Distribution with Tally Marks
Marks RangeTallyFrequency
10–19|||| |6
20–29|||| ||||8
30–39||||4
40–49||2
Total 20

Simple Frequency Table â€“ Most students scored 20–29 marks, while few achieved 40–49, showing mid-range performance concentration.

i. Relative Frequency Distribution

ii. Shows proportion/percentage of each frequency compared to the total.

iii. Formula:

(Relative Frequency = Frequency ÷ Total)

Marks RangeFrequencyRelative Frequency
10–1960.30
20–2980.40
30–3940.20
40–4920.10
Total201.00

Relative Frequency Table â€“ Around 40% students scored in 20–29 range, highlighting it as the dominant score group.

i. Cumulative Frequency Distribution

ii. Shows the running total of frequencies up to a certain class.

iii. Two types:

Greater than Cumulative Frequency â†’ Subtracts frequencies starting from total downward.

Cumulative Frequency (Less Than Type)

Marks Range (≤)FrequencyCumulative Frequency (≤)
≤1966
≤29814
≤39418
≤49220

Cumulative Frequency (Less Than) â€“ Nearly 90% students scored below 40, indicating majority stayed under this performance threshold.

Cumulative Frequency (Greater Than Type)

Marks Range (≥)FrequencyCumulative Frequency (≥)
≥102020
≥201414
≥3066
≥4022

 Cumulative Frequency (Greater Than) â€“ Only 10% scored 40 or more, showing high marks were achieved by very few students.

Percentage Frequency Table

Marks RangeFrequencyPercentage (%)
10–19630%
20–29840%
30–39420%
40–49210%
Total20100%

Percentage Frequency Table â€“ The 20–29 class contributes 40% of total, clearly the most common scoring interval.

Real life Example

Example: Students’ Favorite Subjects Survey

Suppose a survey was conducted among 50 students about their favorite subject.

SubjectNo. of Students (Frequency)Relative Frequency (%)Cumulative Frequency
Mathematics1212/50 = 24%12
Science1515/50 = 30%27
English1010/50 = 20%37
History88/50 = 16%45
Arts55/50 = 10%50
Total50100%–
RANDBETWEEN

RANDBETWEEN is an Excel function that generates a random integer between two specified numbers.

When to use: Creating sample data, simulating scenarios, testing formulas, or assigning random values within a defined range.

Steps to Apply RANDBETWEEN for Raw Data

1. Understand the frequency table:
Marks RangeFrequency (No. of Students)
10–196
20–298
30–394
40–492
Total20 
This means:

i. 6 students scored between 10–19

ii. 8 students scored between 20–29

iii. 4 students scored between 30–39

iv. 2 students scored between 40–49

2. Generate random marks with RANDBETWEEN:

i. For 10–19 range → =RANDBETWEEN(10,19) → drag down for 6 students

ii. For 20–29 range → =RANDBETWEEN(20,29) → drag down for 8 studentsFor 30–39 range → =RANDBETWEEN(30,39) → drag down for 4 students

iii. For 40–49 range → =RANDBETWEEN(40,49) → drag down for 2 students

iv. Final Table (Raw Data Generated):

StudentRangeMarks (Generated)
110–1915
210–1912
310–1917
410–1919
510–1911
610–1914
720–2921
820–2924
920–2928
1020–2923
1120–2927
1220–2925
1320–2920
1420–2922
1530–3935
1630–3933
1730–3937
1830–3932
1940–4942
2040–4947

Why is Tabular Arrangement Important?

1 Helps teachers assess student performance trends.

2. Aids businesses in analyzing sales and costs.

3. Enables researchers to summarize survey results.

4. Provides governments with structured census and demographic reports.

Conclusion

Tabular arrangement of data is more than just a presentation method, it is the foundation of statistical analysis. By organizing raw numbers into meaningful tables, we unlock insights that drive decisions in education, business, research, and governance.

For an in-depth understanding, please refer to our book, “Academic Research Fundamentals: Research Writing and Data Analysis”. It is available as an eBook here, or you may purchase the hardcopy here .