Last Updated: September 14, 2025
Author: Ze-Fn (Zelvy Fauzan, M.Pd.)
Repository: [https://github.com/Ze-Fn/Data-Science-Journey]
Welcome to my personal journey in mastering Data Science! This syllabus outlines a structured, self-paced curriculum inspired by top Bachelor’s-level programs (e.g., from universities like Northeastern, Stanford, and UC Berkeley). I’ll document every step—notes, code, projects, and reflections—right here in this repo. Each course has a dedicated folder with Jupyter notebooks, datasets, and progress trackers. Feel free to fork, star, or contribute ideas!
This program is divided into three levels: Fundamental (math & programming basics), Intermediate (data handling & analysis), and Advanced (applied ML & big data). Expect 6-12 months to complete, assuming 10-15 hours/week. Track your progress with weekly commits and badges (e.g., via Shields.io).
Why this syllabus? It’s designed for real-world applicability, blending theory with hands-on projects. By the end, you’ll have a portfolio showcasing end-to-end data science workflows.
By completing this syllabus, you’ll be able to:
Use this Notion template or Trello for personal tracking, and mirror updates in GitHub issues.
Focus: Math and programming essentials. Estimated time: 8 weeks.
Description: Covers limits, derivatives, and integrals with applications to data rates (e.g., optimization in models).
Duration: 4 weeks.
Folder: /fundamentals/calculus-i/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: Vectors, matrices, eigenvalues—key for dimensionality reduction and neural nets.
Duration: 4 weeks.
Folder: /fundamentals/linear-algebra/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: Probability distributions, hypothesis testing—foundation for inference in data.
Duration: 4 weeks.
Folder: /fundamentals/prob-stats/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: Basics of Python syntax, libraries like NumPy/Pandas for data tasks.
Duration: 4 weeks.
Folder: /fundamentals/python-intro/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Focus: Handling real data. Estimated time: 12 weeks.
Description: Efficient data handling with trees, graphs—optimizes DS code.
Duration: 4 weeks.
Folder: /intermediate/dsa/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: SQL/NoSQL for querying and storing data.
Duration: 4 weeks.
Folder: /intermediate/databases/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: Telling stories with charts using Matplotlib/Seaborn/Tableau.
Duration: 4 weeks.
Folder: /intermediate/viz/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: Extract patterns via clustering, association rules.
Duration: 4 weeks.
Folder: /intermediate/data-mining/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Focus: Production-ready DS. Estimated time: 12 weeks.
Description: Supervised/unsupervised models, evaluation metrics.
Duration: 6 weeks.
Folder: /advanced/ml/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: Spark/Hadoop for large-scale processing.
Duration: 4 weeks.
Folder: /advanced/big-data/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
Description: Integrate everything in a full pipeline.
Duration: 2 weeks.
Folder: /advanced/capstone/
Relevant Materials:
Practice Sessions:
Portfolios/Projects:
/ethics/
.Star this repo if it helps! Open issues for feedback. Follow my progress via commits. Questions? E-mail.
License: MIT—feel free to adapt!