Discipline Overview
Data
Science
Transforming raw information into decisions that matter. Discover the methods, tools, and mindset behind modern data-driven intelligence.
Explore the Field4.6M
Data science jobs projected by 2026
36%
Faster growth than any other field
2.5EB
Data created every single day
$128k
Median annual salary (US)
Workflow
How data science works
Every insight begins with a question and ends with action. This is the loop that drives it all.
01
Define the Problem
Frame the business or research question precisely. Vague questions yield unusable answers.
02
Collect Data
Source structured and unstructured data from databases, APIs, sensors, or surveys.
03
Clean & Transform
Handle missing values, outliers, and inconsistencies. Up to 80% of the work lives here.
04
Model & Analyse
Apply statistical or machine learning models to find patterns and test hypotheses.
05
Communicate
Translate findings into clear narratives and visuals that drive confident decisions.
Core Skills
What a data scientist masters
The discipline sits at the intersection of mathematics, technology, and domain knowledge.
Statistical Analysis
Probability, hypothesis testing, regression, and Bayesian methods form the backbone of rigorous insight.
Machine Learning
Supervised, unsupervised, and reinforcement learning algorithms that let machines improve from experience.
Data Engineering
Building pipelines, warehouses, and lakehouses that move and store data reliably at scale.
Data Visualisation
Turning numbers into charts, dashboards, and stories that non-technical audiences can act on.
SQL & Databases
Querying, joining, and aggregating structured data remains the most universally demanded skill.
Domain Expertise
Knowing the industry context—finance, health, retail—separates good models from impactful ones.
"Without data you're just another person with an opinion." — W. Edwards Deming
See the Toolkit
Ecosystem
Languages, libraries & platforms
The modern data scientist moves across a rich ecosystem of tools. Hover to highlight.
Most-used languages among data scientists (2024 survey)
86%
Python
50%
SQL
37%
R
25%
Scala
20%
Julia
15%
Java
Python
R
SQL
Jupyter
Pandas
NumPy
Scikit-learn
TensorFlow
PyTorch
Spark
Tableau
Power BI
dbt
Airflow
BigQuery
Snowflake
AWS SageMaker
Databricks
Git
Docker