π SESSION 1 β BASICS OF DATA LITERACY
(Data Meaning, Importance, Types of Data, Data Points, Data Pyramid, Impact of Data Literacy)
1. What is Data?
Data refers to raw facts and figures related to an object, person, or event.
Example: 45 kg, Class 9, 78 marks, Red colour, Kolkata etc.
UNIT 2A
Though the word “data” is technically plural of “datum”, in modern usage, data is treated as singular in data science.
2. What is Data Literacy?
According to the PDF:
Data literacy is the ability to understand, interpret and communicate with data.
UNIT 2A
Being data literate means you can:
- Ask correct questions about data
- Read and understand patterns
- Use data to make correct decisions
β Examples from student life:
- Checking if news is fake or real
- Understanding marks pattern
- Making better purchases
- Comparing phone prices
- Tracking fitness data from smartwatch
3. Why is Data Literacy Important?
The world runs on data today β schools, supermarkets, hotels, companies, politics β all rely on data.
Your PDF says:
“The systems of the world are driven by data. So, it is imperative to be an efficient data literate.”
UNIT 2A
Benefits of being Data Literate:
- Smarter consumer β no cheating during online shopping
- Informed citizen β understanding facts behind news
- Career readiness β most jobs require data skills
- Better personal choices β health, finance, studies
4. Understanding Information From Data
Your PDF explains:
βLogically related data becomes information.β
UNIT 2A
Example:
- Data: 45, 50, 55 β Marks
- Information: Student improved over 3 tests
5. Types of Data
The PDF divides data into 2 broad types:
I. Quantitative Data
Data represented by numbers.
Examples:
- Height
- Weight
- Age
- Test scores
- Temperature
UNIT 2A
II. Qualitative Data
Descriptive, non-numeric data.
Examples:
- Names
- Product description
- Gender
- Patient history
- Yes/No type answers
UNIT 2A
6. DATA POINTS
Data is not only numbers β it includes images, audio, video, documents.
Examples:
- X-ray image
- CCTV video
- Audio interview
- Photo of a product
UNIT 2A
These provide βadditional detailsβ beyond numbers.
7. THE DATA PYRAMID
(Explained across pages 2β3 of PDF)
UNIT 2A
The pyramid consists of:
1οΈβ£ DATA
Raw facts; unprocessed values.
2οΈβ£ INFORMATION
When values are logically related.
Example: Correlation between attendance & marks.
3οΈβ£ KNOWLEDGE
Meaningful insights.
Example: Student has high marks even with low attendance β indicates special interest.
4οΈβ£ WISDOM
Understanding βWhyβ.
Example: Student loves programming β explains unusual pattern.
8. IMPACT OF DATA LITERACY
Detailed on pages 3β5.
UNIT 2A
A. Impact on Individuals
β 1. Better Decision-Making
- Understanding interest rates
- Budgeting
- Interpreting fitness app data
β 2. Career Growth
Many job sectors need data skills:
- Marketing
- Finance
- Healthcare
- Business
β 3. Critical Thinking
- Detecting fake news
- Understanding trends
- Problem solving using patterns
β 4. Civic Understanding
- Understanding climate data
- Crime data analysis
- Analysing school performance data
B. Impact on Society
β Economic Growth
Companies use data for innovation (example: Amazon).
β Better Healthcare
Personalized medicine using patient data.
β Better Government Policies
Using data to identify inequalities.
β Environmental Sustainability
Analysing water usage, pollution, energy consumption.
9. CHALLENGES OF DATA LITERACY
(From page 5)
UNIT 2A
- Accessibility (digital divide)
- Privacy and ethics
- Need for continuous learning
- Misinformation/disinformation (as defined by UNICEF)
10. How to Become Data Literate?
(From page 6)
UNIT 2A
To become data literate, you must learn:
- Identifying correct data
- Knowing data types
- Cleaning data
- Analysing using tools
- Basics of statistics
- Visualising using graphs
- Understanding story behind data
- Ethics & privacy
- Critical thinking
11. Working Example: Healthy Habits Survey
Pages 6β8 include a complete example.
UNIT 2A
Steps explained:
- Designing survey
- Collecting data
- Cleaning data
- Analysing averages
- Creating graphs
- Presenting findings
This teaches the entire data cycle in practical form.
12. DATA PRIVACY & DATA SECURITY
(Pages 8β12)
UNIT 2A
β Data Privacy
Protecting personal information.
β Data Security
Protecting data from theft, hacking, unauthorized access.
Real-world examples from PDF:
- Air India Data Breach
- BigBasket Data Breach
- JusPay leak
- SolarWinds hack
- T-Mobile data breach
(All explained on pages 12β13)
13. Threats in AI & Privacy Violations
The PDF gives three major issues:
- Collecting too much personal data
- Predicting sensitive information
- Sharing without consent
UNIT 2A
14. Best Practices for Cybersecurity
(Pages 10β12)
UNIT 2A
- Strong passwords
- Two-factor authentication
- Updating software
- Avoiding phishing
- Checking HTTPS
- Using VPN on public Wi-Fi
- Limiting app permissions
- Backups
- Antivirus
15. DOs & DONβTs Table
(Pages 15β16)
UNIT 2A
DOs:
- Strong passwords
- 2FA
- Updated software
- Secure websites
- Lock your screen
- Connect only with trusted people
DONβTs:
- Do not share personal info
- Do not download unverified apps
- Do not open unknown email links
- Do not copy pirated software
π **UNIT 2A β SESSION 2
ACQUIRING, PROCESSING & INTERPRETING DATA (EXTREMELY DETAILED)**
π 1. Introduction to Data in the Real World
Modern life is driven by data.
Your PDF states:
βIn our daily activities we use and produce data.β
UNIT 2A
Examples:
- Clicking photographs (image data)
- Fitness tracker steps (numerical data)
- Online shopping history
- Exam marks & attendance
- Bank transactions
π 2. Types of Data (Very Important)
Your PDF categorizes data into two major types:
A. QUALITATIVE DATA
This is descriptive (non-numeric) data.
ξ£ Examples from PDF:
- Names
- Gender
- Address
- Patient history
UNIT 2A
β Explained
You cannot add, subtract, or calculate anything directly with qualitative data.
It answers βWhat kind?β, βWhich type?β
B. QUANTITATIVE DATA
Data expressed as numbers, which can be counted or measured.
ξ£ Examples:
- Body temperature
- Age
- House number
UNIT 2A
β Explained
You can use mathematical formulas, averages, and graphs.
π 3. Sub-Types of Quantitative Data
Your PDF goes deeper into two sub-categories:
1οΈβ£ Discrete Data
- Countable
- Whole numbers only
UNIT 2A
β Examples:
- Number of students in a class
- Number of cars in a parking lot
β Explanation
You can count discrete data, but you cannot measure it on a scale.
2οΈβ£ Continuous Data
- Measurable
- Can take any value including decimals
UNIT 2A
β Examples:
- Height (165.5 cm)
- Weight (45.3 kg)
- Time
π 4. Data in AI β How Data is Viewed by Machines
Your PDF explains that humans view data differently from AI.
UNIT 2A
Machines require data to be:
- Properly structured
- Encoded
- Labelled
Examples from PDF:
- AI sees an image as numerical grids (pixels).
- AI sees text as numerical embeddings.
π 5. Data Acquisition β Collecting Data
One of the most important skills in AI.
Your PDF lists multiple ways of acquiring data
UNIT 2A
A. PRIMARY DATA
You collect it directly by yourself.
β Methods:
- Surveys
- Questionnaires
- Interviews
- Experiments
- Observations
- Sensors (CCTV, fitness band, temperature sensors)
UNIT 2A
β Example
Conducting a βHealthy Habits Surveyβ in school.
B. SECONDARY DATA
Data collected by someone else, but used by you.
β Sources
- Websites
- Databases
- Journals
- Apps
- Social media
- Government reports
UNIT 2A
π 6. Modern Data Collection Techniques
1οΈβ£ Web Scraping
Extracting data from websites using tools like:
- Python BeautifulSoup
- Browser extensions
- API requests
Mentioned clearly in PDF.
UNIT 2A
β Example:
Scraping prices of mobiles from Flipkart.
2οΈβ£ Data Augmentation
Used mainly in AI for improving datasets.
Your PDF says:
βData augmentation is creating new data points from existing ones.β
UNIT 2A
β Examples
For images:
- Rotation
- Flipping
- Cropping
- Color adjustment
- Adding noise
For text:
- Synonym replacement
- Paraphrasing
β Why needed?
To increase training data and avoid overfitting.
3οΈβ£ Data Generation
Creating data using simulations.
Examples from PDF:
- Weather simulation
- Synthetic images
- Virtual environments
UNIT 2A
4οΈβ£ Sensors / IoT
Smart devices generate continuous data streams.
Examples:
- Smart electricity meters
- Fitness watch
- CCTV
- GPS
UNIT 2A
π 7. Understanding Quality Data (VERY IMPORTANT)
Your PDF clearly defines the qualities of βgood dataβ:
UNIT 2A
β 1. Relevance
Data must directly relate to your problem.
Example: To predict school marks, hair colour is irrelevant.
β 2. Accuracy
Data must be correct and error-free.
β 3. Completeness
No missing values.
β 4. Consistency
Data should not contradict itself.
Example:
Age: 15 years
Date of birth: 2023 β inconsistent.
β 5. Timeliness
Up-to-date data.
Old data creates wrong predictions.
β 6. Volume
Large amount of data helps AI learn better.
π 8. DATA PROCESSING β CLEANING THE DATA
Your PDF gives detailed explanation of why raw data must be cleaned.
UNIT 2A
A. Missing Values
Data may be incomplete.
β Example
Height column:
160, β , 158, 162
β Methods to fix:
- Replace with average
- Replace with mode
- Remove row
B. Outliers (Very Important)
Your PDF gives precise definition:
βAn outlier is an abnormal or unusual observation.β
UNIT 2A
β Example
Marks: 85, 88, 90, 12 β 12 is an outlier.
C. Inconsistencies
Conflicting or illogical values.
Example:
- Age 4 but Class 9 student
- Date of birth mismatch
D. Duplicates
Same entry repeated twice.
E. Wrong Data Format
Date formats differ: 12/03/2024 vs 03-12-24.
F. Unstructured Data
Images, audio, videos β need preprocessing.
PDF explains all these clearly.
UNIT 2A
π 9. Tools Used for Data Processing
Your PDF lists many:
UNIT 2A
- MS Excel
- Google Sheets
- Python
- Tableau
- SQL
- AI-based tools
π 10. DATA REPRESENTATION β TABLES & CHARTS
Your PDF uses multiple graphs (bar, line, pie) to teach data visualization.
UNIT 2A
1οΈβ£ Tables
Used to organize raw data.
- Rows β Records
- Columns β Attributes
2οΈβ£ Charts
β Bar Chart
Used for comparison.
β Line Chart
Used for trends over time.
β Pie Chart
Used for percentage distribution.
β Histogram
Used for frequency distribution.
β Scatter Plot
Used to find correlations.
π 11. INTERPRETING DATA β MAKING SENSE OF PATTERNS
Your PDF emphasizes that simply collecting data is NOT enough.
You must interpret it.
UNIT 2A
β Interpretation means:
- Understanding trends
- Finding reasons
- Making decisions
β Example from PDF β Diet & Workout Observation
UNIT 2A
Observation:
- Diet quality: High
- Workout quality: Average
- Steps per day: Above average
Interpretation:
Healthy habits improve well-being.
π 12. Real-Life Case Study (PDF Example)
Pages 16β25 provide a very detailed case study.
UNIT 2A
Topics included:
- Weekly health survey
- Data tabular sheet
- Missing values
- Outlier detection
- Graph plotting
- Creating insights
This shows the complete data pipeline in real life.
π **UNIT 2A β SESSION 3: USING DATA TO SOLVE PROBLEMS
(FULL, EXTREMELY DETAILED STUDY MATERIAL)**
This session focuses on how to interpret, analyse, and use data to make decisions in real-world situations.
β 1. What Is Data Interpretation?
According to your PDF, data interpretation means:
βRelating the data logically, identifying patterns, testing ideas, and deriving insights.β
UNIT 2A
In simple words:
Data interpretation = Understanding what the data is trying to tell us.
This helps us:
- Make decisions
- Solve problems
- Understand trends
- Predict outcomes
β 2. Why Do We Need Data Interpretation?
Your PDF clearly states that interpretation helps in:
- Critical thinking
- Problem solving
- Making informed decisions
UNIT 2A
Examples:
β Teachers analyse student performance β find weak areas
β Doctors analyse reports β decide treatments
β Businesses analyse sales β choose discounts
β Government analyses crime data β improve safety
β 3. Quantitative Data Interpretation
Your PDF explains 9 major uses of quantitative data (very important).
Pages 48β50.
UNIT 2A
Letβs explain each in simple words:
1οΈβ£ Comparison
Used to compare values.
Example: Comparing marks of two classes.
2οΈβ£ Trend Analysis
Studying how values increase or decrease over time.
Example: Temperature over a week.
3οΈβ£ Correlation
Finding if two values move together.
Example:
More study hours β higher marks (positive correlation)
4οΈβ£ Measuring Performance
Used to check improvement.
Example:
A shop checking daily sales.
5οΈβ£ Decision-Making
Making decisions based on numbers instead of guesswork.
Example:
Which product to restock based on highest sales.
6οΈβ£ Resource Allocation
Distributing resources properly.
Example:
School decides to add new teachers based on student strength.
7οΈβ£ Predicting Future
Using past data to predict future patterns.
Example:
Weather forecasting using temperature & humidity.
8οΈβ£ Understanding Distribution
Finding how values are spread.
Example:
Most students scoring 70β80 shows moderate performance.
9οΈβ£ Assessing Impact
Checking if something had an effect.
Example:
Did the new teaching method improve marks?
β 4. Correlation vs Causation
Your PDF highlights this as an important reasoning skill.
(Implicit in analysis sections)
β Correlation
Two things happen together.
Example:
Ice-cream sales and temperature both increase.
β Causation
One thing causes the other.
Example:
More study β better marks.
Important:
Correlation does NOT always mean causation.
β 5. Tools Used for Data Interpretation
PDF mentions various tools including:
- Excel
- Google Sheets
- Tableau (detailed section p.56β59)
UNIT 2A
Tableau is used for:
- Interactive dashboards
- Bar, line, scatter, pie charts
- Analysing data visually
β 6. Understanding Qualitative Data Interpretation
(Explained in detail on pages 42β45)
UNIT 2A
Qualitative data means descriptive text or words (not numbers).
Steps of interpretation:
1οΈβ£ Data Collection
Interviews, focus groups, observations
UNIT 2A
2οΈβ£ Data Organisation
Convert responses into text documents.
3οΈβ£ Coding & Categorising
Identify themes like:
- βtechnical issuesβ
- βfood qualityβ
- βservice problemsβ
4οΈβ£ Analysis
Find patterns.
5οΈβ£ Interpretation
Understand bigger meaning.
β 7. Examples from the PDF (Very Important)
Your PDF provides multiple real-world qualitative interpretation examples.
Example 1 β Student Experiences in Remote Learning
(Page 45)
UNIT 2A
Issues found:
- Technical problems
- Lack of interaction
- Benefit: Flexible timings
Interpretation β Improve tech support and add interactive sessions.
Example 2 β Customer Preferences for Products
(Page 45)
UNIT 2A
Findings:
- Design appealing
- Price too high
Interpretation β Adjust price and simplify features.
Example 3 β Employee Satisfaction
(Page 45)
UNIT 2A
Themes:
- Work-life balance
- Job satisfaction
Interpretation β Improve employee support.
β 8. Combining Quantitative & Qualitative Data
Your PDF emphasises:
βBoth types help understand complete picture.β
UNIT 2A
Example:
- Quantitative β 80% students dislike cafeteria food
- Qualitative β Reasons include oily food, taste, hygiene
Combining both β Improves decision-making.
β 9. Steps to Solve Any Problem Using Data
(Complete workflow based on your PDF)
STEP 1: Identify the Problem
Example: Students unhappy with school cafeteria.
STEP 2: Collect Data (Quantitative + Qualitative)
From:
- Surveys
- Interviews
- Observations
UNIT 2A
STEP 3: Clean Data
Remove:
- Missing values
- Duplicates
- Outliers
UNIT 2A
STEP 4: Organise Data into Tables
Attributes (columns):
- Meal name
- Rating
- Grade
UNIT 2A
STEP 5: Analyse Data
Use graphs:
- Bar chart
- Pie chart
- Line graph
UNIT 2A
STEP 6: Interpret Results
Example:
Most students prefer Rajma Chawal β Add more servings.
STEP 7: Present Findings
Using Tableau or PPT.
UNIT 2A
β 10. Common Student Errors in Data Interpretation
(Explained through exercises in PDF)
UNIT 2A
Mistakes include:
- Wrong graph selection
- Ignoring outliers
- Confusing correlation with causation
- Misreading percentages
β 11. Case Study β School Cafeteria Survey
(Pages 55β56)
UNIT 2A
Students must:
- Collect 10 survey responses
- Clean the data
- Find average food ratings
- Identify popular/least popular items
- Suggest improvements
- Create graphs
β 12. Why Data Interpretation Matters in AI
Data interpretation helps AI to:
- Detect patterns
- Make predictions
- Learn from trends
- Improve accuracy
Without correct interpretation β AI gives wrong results.