Skip to content

MIT-Emerging-Talent/afghan-girls-online-learning-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

49 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Afghan Girls Online Education

What Helps Afghan Girls Succeed in Online College?

A data-driven study on online learning effectiveness for Afghan women in higher education

License: MIT Python Status


๐Ÿ“– About

This research project investigates the factors that enable Afghan girls to succeed in online university programs. Through an anonymous survey of 322 students, we examined infrastructure challenges, teaching quality, safety concerns, and support systems to provide evidence-based recommendations for improving online higher education access.

Sample Overview

Figure 1: Survey demographics and key outcome metrics

Key Findings at a Glance

Metric Value
๐Ÿ“Š Total Responses 322 students
๐Ÿ“ Location 87.3% inside Afghanistan
โญ Overall Satisfaction 3.51 / 5.0
๐ŸŽฏ Persistence Rate 64.6% plan to continue

๐ŸŽฏ Research Questions

  1. Which conditions most strongly influence satisfaction and persistence in online learning?
  2. How do infrastructure constraints limit the effectiveness of pedagogical practices?
  3. Do safety and family support buffer against connectivity challenges?

๐ŸŒ Context: Why This Research Matters

The Afghanistan Education Crisis

Afghanistan is the only country in the world banning girls from secondary and women from higher education (UNESCO, 2024). Approximately 2.2 million girls remain out of school, making online learning a critical lifeline for those seeking education.

Indicator Current Status
๐Ÿšซ Education Ban Only country with complete ban on girls' secondary & women's higher education
๐Ÿ‘ง Girls Affected ~2.2 million out of school (UNESCO, Aug 2025)
๐ŸŒ Internet Access ~13.2M users, only 30.5% penetration (DataReportal, Jan 2025)
โšก Infrastructure Fragile services, frequent outages, high costs

Recent Disruptions (2025)

  • September 29-30: Nationwide internet and telecom blackout

These conditions make understanding what helps students persist despite challenges critically important.


๐Ÿ“š Theoretical Framework

Our research draws on four established frameworks to understand online learning success:

Framework Core Idea What We Measured
Community of Inquiry (CoI) Learning needs teaching, social, and cognitive presence Assignment clarity, instructor feedback, language accessibility
Self-Determination Theory (SDT) Motivation requires autonomy, competence, and relatedness Study flexibility, feeling safe, family support
Tinto's Integration Model Persistence needs academic and social integration Materials access, platform usability, community connection
TAM/UTAUT Technology adoption depends on usefulness and ease of use Platform usability, video loading, internet reliability

Theory โ†’ Survey โ†’ Insights

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”     โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”     โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚   Theoretical   โ”‚     โ”‚  Survey Design  โ”‚     โ”‚   Actionable    โ”‚
โ”‚   Frameworks    โ”‚ โ”€โ”€โ–บ โ”‚   (25 items)    โ”‚ โ”€โ”€โ–บ โ”‚   Insights      โ”‚
โ”‚   CoI, SDT,     โ”‚     โ”‚  Infrastructure โ”‚     โ”‚  for improving  โ”‚
โ”‚   Tinto, TAM    โ”‚     โ”‚  Pedagogy, Safetyโ”‚    โ”‚  online learningโ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜     โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜     โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

๐Ÿ“ View Full Research Background โ†’


๐Ÿ“Š Results & Analysis

๐Ÿ”Œ Infrastructure Challenges

Students face significant connectivity and power issues, especially inside Afghanistan.

Infrastructure Challenges

Figure 2: Internet and electricity reliability gaps by location

  • Internet Reliability: 2.61/5.0 (below neutral)
  • Electricity Reliability: 2.61/5.0 (below neutral)
  • Implication: Offline materials and flexible deadlines are critical

๐Ÿ“š Pedagogical Factors

Teaching quality is the strongest driver of student satisfaction.

Pedagogical Factors

Figure 3: Teaching quality ratings across different dimensions

Strength Gap to Address
โœ… Language understandability โš ๏ธ Instructor feedback
โœ… Platform usability โš ๏ธ Assignment clarity
โš ๏ธ Video quality

๐Ÿ›ก๏ธ Safety & Support Systems

Nearly half of students face fear or lack adequate family support.

Support and Safety

Figure 4: Support and safety levels across key dimensions

  • Can Study Without Fear: 42.8%
  • Have Family Support: 52.8%
  • Feel Safe Online: Higher digital security perception

๐Ÿ“ˆ Persistence Predictors

Logistic regression reveals the strongest factors influencing student continuation.

Persistence Predictors

Figure 5: Top predictors of student persistence (standardized coefficients)

Top Positive Predictors:

  1. ๐Ÿ† Instructor feedback quality
  2. ๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘ง Family support
  3. ๐Ÿ“ Assignment clarity
  4. ๐Ÿ  Ability to study without fear
  5. ๐Ÿ“– Materials access

๐Ÿ’ก Evidence-Based Recommendations

Recommendations

Figure 6: Five key recommendation areas based on our findings

Satisfaction vs Persistence

Figure 7: Strong correlation between satisfaction and intention to continue

๐Ÿ“ View Complete Analysis Details โ†’


๐ŸŽฏ Key Recommendations

Based on our analysis, we propose five evidence-based strategies for improving online learning outcomes:

Priority Recommendation Expected Impact
๐Ÿฅ‡ 1 Enhance Instructor Feedback โ€” Increase frequency, clarity, and responsiveness Strongest predictor of both satisfaction and persistence
๐Ÿฅˆ 2 Support Families โ€” Develop family engagement programs and resources Buffers against infrastructure challenges
๐Ÿฅ‰ 3 Improve Assignment Clarity โ€” Clear instructions, examples, and scaffolding Reduces confusion and dropout
4๏ธโƒฃ Provide Offline Materials โ€” Downloadable PDFs, audio, low-bandwidth options Critical for 70%+ facing connectivity issues
5๏ธโƒฃ Build Safe Environments โ€” Privacy protections, peer support networks Nearly half face fear or restrictions

๐Ÿ”ฌ Methodology

Study Design

Aspect Details
Design Cross-sectional survey study
Sample 322 Afghan women enrolled in online university programs
Collection Period October 2025
Survey Language Farsi (with English translation for analysis)
Survey Items 25 questions covering demographics, infrastructure, pedagogy, safety, and outcomes

Analysis Methods

  • Descriptive Statistics โ€” Sample characteristics and variable distributions
  • Correlation Analysis โ€” Relationships between factors and outcomes
  • Logistic Regression โ€” Predictors of student persistence
  • K-Means Clustering โ€” Student segmentation for targeted interventions
  • Thematic Coding โ€” Qualitative analysis of open-ended responses

Data Processing Pipeline

Raw Survey โ†’ PII Masking โ†’ Farsiโ†’English Translation โ†’ Data Cleaning โ†’ Analysis

๐Ÿ“ View Data Dictionary โ†’


โš ๏ธ Limitations & Future Work

Limitations

Limitation Implication
Cross-sectional design Cannot establish causality; associations only
Self-reported intentions Actual persistence behavior may differ
Convenience sampling May not represent all Afghan women in online education
Single institution Generalizability to other contexts requires validation
Connectivity bias Students with worst connectivity may be underrepresented

Future Research Directions

  • ๐Ÿ“Š Longitudinal tracking โ€” Follow students over multiple terms to measure actual persistence
  • ๐ŸŒ Multi-institution study โ€” Compare across different online programs
  • ๐Ÿ” Qualitative deep-dives โ€” In-depth interviews on safety and family dynamics
  • ๐Ÿงช Intervention testing โ€” Pilot and evaluate specific improvements
  • ๐Ÿ“ฑ Technology solutions โ€” Develop and test offline-first learning tools

๐Ÿ“‚ Project Structure

โ”œโ”€โ”€ Data/
โ”‚   โ”œโ”€โ”€ Raw_data/          # Original survey responses (excluded from repo)
โ”‚   โ””โ”€โ”€ Processed_data/    # Cleaned, anonymized datasets
โ”œโ”€โ”€ notebooks/             # Data cleaning and preparation scripts
โ”œโ”€โ”€ Research_BG/           # Literature review and theoretical framework
โ”œโ”€โ”€ results/               # Analysis notebooks and visualizations
โ”‚   โ”œโ”€โ”€ 01_exploratory_data_analysis.ipynb
โ”‚   โ”œโ”€โ”€ 02_infrastructure_analysis.ipynb
โ”‚   โ”œโ”€โ”€ 03_pedagogical_effectiveness.ipynb
โ”‚   โ”œโ”€โ”€ 04_safety_support_analysis.ipynb
โ”‚   โ”œโ”€โ”€ 05_persistence_modeling.ipynb
โ”‚   โ””โ”€โ”€ 08_final_visualizations.ipynb
โ””โ”€โ”€ collabration/          # Team documentation and retrospectives

๐Ÿš€ Getting Started

Prerequisites

  • Python 3.8+
  • Jupyter Lab/Notebook

Installation

  1. Clone the repository:
git clone https://github.com/MIT-Emerging-Talent/elo2-afghan-girls-online-learning.git
cd elo2-afghan-girls-online-learning
  1. Create a virtual environment and install dependencies:
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\Activate.ps1
pip install -r requirements.txt
  1. Launch Jupyter Lab:
jupyter lab
  1. Start with the notebooks in the results/ folder to explore the analysis.

๐Ÿ”’ Ethics & Privacy

This research prioritizes participant safety and confidentiality:

  • โœ… All responses are anonymous (no names, emails, or identifiable information)
  • โœ… PII masking applied to open-ended responses
  • โœ… Data stored securely with restricted access
  • โœ… Farsi-to-English translation for accessibility
  • โœ… Compliance with ethical research standards for vulnerable populations

๐Ÿค Contributing

This is an academic research project completed as part of the MIT Emerging Talent program. While the analysis is complete, we welcome:

  • Issues or questions about the methodology
  • Suggestions for additional analyses
  • Discussions about similar research contexts

๐Ÿ‘ฅ Team

Madiha Malikzada
Madiha Malikzada

๐Ÿ’ป ๐Ÿ”—
Research ยท Theory Mapping ยท Website
Mahdia Ahmadi
Mahdia Ahmadi

๐Ÿ’ป ๐Ÿ”—
Repo Setup ยท Analysis ยท Documentation
Safiya Hashimi
Safiya Hashimi

๐Ÿ’ป ๐Ÿ”—
Data Cleaning ยท Documentation ยท Presentation

Contributions

Madiha Malikzada โ€” Led data collection, background research, and the mapping of CoI, SDT, Tinto, and TAM/UTAUT frameworks to survey design. Masked identifiable information in open-text responses, translated Farsi/Dari inputs, contributed to analysis and documentation, and built the project website.

Mahdia Ahmadi โ€” Led the technical implementation of the project. Initialized and structured the repository, developed the complete analysis pipeline including exploratory data analysis, infrastructure analysis, pedagogical effectiveness assessment, safety & support analysis, persistence modeling (logistic regression), student segmentation (k-means clustering), and thematic coding of qualitative responses. Created publication-ready visualizations, organized research outputs, and contributed extensively to documentation.

Safiya Hashimi โ€” Cleaned and prepared the dataset for analysis, contributed to documentation, and helped present results from the analysis.

Program: MIT Emerging Talent โ€” ELO2 Cohort
Timeline: October โ€“ December 2025


๐ŸŒ Project Artifacts

๐ŸŒ Website ๐Ÿ“Š Presentation
Website Presentation
Explore our interactive research findings View our final presentation slides

๐Ÿ™ Acknowledgments

We extend our gratitude to:

  • The 322 Afghan women who participated in this survey, sharing their experiences to help improve online education
  • MIT Emerging Talent Program for providing the framework and support for this research
  • Our mentors and reviewers for guidance throughout the project
  • The online university that facilitated survey distribution

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


๐Ÿ“š Citation

If you use this research or data in your work, please cite:

@misc{malik2025afghan,
  author = {Malik, Madiha and Ahmadi, Mahdia and Hashimi, Safiya},
  title = {What Helps Afghan Girls Succeed in Online College?},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub Repository},
  howpublished = {\url{https://github.com/MIT-Emerging-Talent/elo2-afghan-girls-online-learning}},
  note = {MIT Emerging Talent Program}
}

Or in plain text:

Malik, M., Ahmadi, M., & Hashimi, S. (2025). What Helps Afghan Girls Succeed in Online College? MIT Emerging Talent Program. https://github.com/MIT-Emerging-Talent/elo2-afghan-girls-online-learning


๐Ÿ“ง Questions or Collaboration? Open an issue or reach out to the team on LinkedIn.


Built with โค๏ธ for Afghan girls pursuing education against all odds

ยฉ 2025 MIT Emerging Talent | ELO2 Cohort

About

ELO2 project: Anonymous survey of Afghan online university students to understand what makes online learning effective for girls and how programs can improve.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

โšก