Statistics Made Simple: The Art of Arranging Data

We commonly see raw statistics in our everyday lives, such student grades, monthly bills, product sales, or survey data. But when there are too many of these statistics, it might be hard to interpret and analyze them. This is where putting data in tables comes in. We turn random numbers into useful information by putting them in rows and columns in a methodical way.
Read Also : How to Learn Statistical Data Analysis: Key Concepts and Practical Tips
What is Data Arrangement in Tabular Form?
Putting raw data into organized tables is called data arrangement in tabular form. This makes it simpler to view, compare, and analyze the data. A table usually has headers, rows, and columns, and each of them shows a distinct part of the data.
Key Features of Tabular Presentation
1. Systematic Organization – Data is arranged in logical order for clarity.
2. Comparability – Makes it easy to compare values across categories.
3. Compactness – Large data is summarized into smaller, understandable parts.
4. Accuracy – Reduces confusion and chances of misinterpretation.
5. Ease of Analysis – Facilitates statistical operations like averages, frequency, and percentages.
Types of Tabular Presentation
1. Simple Table – Shows data of one variable only.
Example: Marks obtained by students in a test.
2. Double Table – Shows relationship between two variables.
Example: Students’ marks based on gender.
3. Complex Table – Includes multiple variables.
Example: Marks by gender and subject.
4. Frequency Table – Shows how often values occur.
Example: Number of students falling in different marks range
Frequency Distribution in Statistics
A frequency distribution organizes raw data into categories or intervals and shows how often (frequency) each value occurs.

1. Class Interval
Range between two class limits showing group width; helps organize data into equal-sized categories for frequency distribution.
2. Class Limits
The lowest and highest values that can belong to a class interval; determine class boundaries in grouped data tables.
3. Class Boundaries
Actual limits between two class intervals, found by adjusting limits to avoid gaps; used in histograms and continuous data.
4. Class Mark (Midpoint)
The central value of a class interval, obtained by averaging lower and upper class limits; represents data concentration point.
Types of Frequency Distributions
Raw Data Example (Marks of 20 Students in a Test out of 50)
12, 18, 25, 30, 22, 27, 35, 40, 15, 20,
28, 32, 45, 38, 25, 18, 29, 31, 24, 26
Table with Class Intervals, Limits, Boundaries, and Class Marks
| Class Interval | Class Limits (L–U) | Class Boundaries | Class Mark (Midpoint) | Frequency |
| 10–19 | 10 – 19 | 9.5 – 19.5 | 14.5 | 6 |
| 20–29 | 20 – 29 | 19.5 – 29.5 | 24.5 | 8 |
| 30–39 | 30 – 39 | 29.5 – 39.5 | 34.5 | 4 |
| 40–49 | 40 – 49 | 39.5 – 49.5 | 44.5 | 2 |
| Total | 20 |
a. Simple Frequency Distribution
b. Shows the count of each value or class.
c. Example:
| Marks | Frequency (No. of Students) |
| 10–19 | 6 |
| 20–29 | 8 |
| 30–39 | 4 |
| 40–49 | 2 |
| Total | 20 |
Simple Frequency Distribution with Tally Marks
| Marks Range | Tally | Frequency |
| 10–19 | |||| | | 6 |
| 20–29 | |||| |||| | 8 |
| 30–39 | |||| | 4 |
| 40–49 | || | 2 |
| Total | 20 |
Simple Frequency Table – Most students scored 20–29 marks, while few achieved 40–49, showing mid-range performance concentration.
i. Relative Frequency Distribution
ii. Shows proportion/percentage of each frequency compared to the total.
iii. Formula:
(Relative Frequency = Frequency ÷ Total)
| Marks Range | Frequency | Relative Frequency |
| 10–19 | 6 | 0.30 |
| 20–29 | 8 | 0.40 |
| 30–39 | 4 | 0.20 |
| 40–49 | 2 | 0.10 |
| Total | 20 | 1.00 |
Relative Frequency Table – Around 40% students scored in 20–29 range, highlighting it as the dominant score group.
i. Cumulative Frequency Distribution
ii. Shows the running total of frequencies up to a certain class.
iii. Two types:
Greater than Cumulative Frequency → Subtracts frequencies starting from total downward.
Cumulative Frequency (Less Than Type)
| Marks Range (≤) | Frequency | Cumulative Frequency (≤) |
| ≤19 | 6 | 6 |
| ≤29 | 8 | 14 |
| ≤39 | 4 | 18 |
| ≤49 | 2 | 20 |
Cumulative Frequency (Less Than) – Nearly 90% students scored below 40, indicating majority stayed under this performance threshold.
Cumulative Frequency (Greater Than Type)
| Marks Range (≥) | Frequency | Cumulative Frequency (≥) |
| ≥10 | 20 | 20 |
| ≥20 | 14 | 14 |
| ≥30 | 6 | 6 |
| ≥40 | 2 | 2 |
Cumulative Frequency (Greater Than) – Only 10% scored 40 or more, showing high marks were achieved by very few students.
Percentage Frequency Table
| Marks Range | Frequency | Percentage (%) |
| 10–19 | 6 | 30% |
| 20–29 | 8 | 40% |
| 30–39 | 4 | 20% |
| 40–49 | 2 | 10% |
| Total | 20 | 100% |
Percentage Frequency Table – The 20–29 class contributes 40% of total, clearly the most common scoring interval.
Real life Example
Example: Students’ Favorite Subjects Survey
Suppose a survey was conducted among 50 students about their favorite subject.
| Subject | No. of Students (Frequency) | Relative Frequency (%) | Cumulative Frequency |
| Mathematics | 12 | 12/50 = 24% | 12 |
| Science | 15 | 15/50 = 30% | 27 |
| English | 10 | 10/50 = 20% | 37 |
| History | 8 | 8/50 = 16% | 45 |
| Arts | 5 | 5/50 = 10% | 50 |
| Total | 50 | 100% | – |
RANDBETWEEN
RANDBETWEEN is an Excel function that generates a random integer between two specified numbers.
When to use: Creating sample data, simulating scenarios, testing formulas, or assigning random values within a defined range.
Steps to Apply RANDBETWEEN for Raw Data
1. Understand the frequency table:
| Marks Range | Frequency (No. of Students) |
| 10–19 | 6 |
| 20–29 | 8 |
| 30–39 | 4 |
| 40–49 | 2 |
| Total | 20 |
This means:
i. 6 students scored between 10–19
ii. 8 students scored between 20–29
iii. 4 students scored between 30–39
iv. 2 students scored between 40–49
2. Generate random marks with RANDBETWEEN:
i. For 10–19 range → =RANDBETWEEN(10,19) → drag down for 6 students
ii. For 20–29 range → =RANDBETWEEN(20,29) → drag down for 8 studentsFor 30–39 range → =RANDBETWEEN(30,39) → drag down for 4 students
iii. For 40–49 range → =RANDBETWEEN(40,49) → drag down for 2 students
iv. Final Table (Raw Data Generated):
| Student | Range | Marks (Generated) |
| 1 | 10–19 | 15 |
| 2 | 10–19 | 12 |
| 3 | 10–19 | 17 |
| 4 | 10–19 | 19 |
| 5 | 10–19 | 11 |
| 6 | 10–19 | 14 |
| 7 | 20–29 | 21 |
| 8 | 20–29 | 24 |
| 9 | 20–29 | 28 |
| 10 | 20–29 | 23 |
| 11 | 20–29 | 27 |
| 12 | 20–29 | 25 |
| 13 | 20–29 | 20 |
| 14 | 20–29 | 22 |
| 15 | 30–39 | 35 |
| 16 | 30–39 | 33 |
| 17 | 30–39 | 37 |
| 18 | 30–39 | 32 |
| 19 | 40–49 | 42 |
| 20 | 40–49 | 47 |
Why is Tabular Arrangement Important?
1 Helps teachers assess student performance trends.
2. Aids businesses in analyzing sales and costs.
3. Enables researchers to summarize survey results.
4. Provides governments with structured census and demographic reports.
Conclusion
Tabular arrangement of data is more than just a presentation method, it is the foundation of statistical analysis. By organizing raw numbers into meaningful tables, we unlock insights that drive decisions in education, business, research, and governance.
For an in-depth understanding, please refer to our book, “Academic Research Fundamentals: Research Writing and Data Analysis”. It is available as an eBook here, or you may purchase the hardcopy here .