Machine Learning Study Notes

General Science July 28, 2025 4 min read

Definition

Machine Learning (ML) is a subset of artificial intelligence (AI) focused on algorithms that enable computers to learn patterns from data and make decisions or predictions without explicit programming.

Core Concepts

Data: Raw information used for training models.
Model: Mathematical representation that learns from data.
Training: Process where the model adjusts parameters based on input data.
Testing: Evaluating model performance on unseen data.
Features: Individual measurable properties or characteristics of data.
Labels: Target output the model tries to predict.

Types of Machine Learning

Type	Description	Example
Supervised Learning	Learns from labeled data	Email spam detection
Unsupervised Learning	Finds patterns in unlabeled data	Customer segmentation
Reinforcement Learning	Learns by interacting with environment, receiving feedback	Game playing AI
Semi-supervised Learning	Mixes labeled and unlabeled data	Image classification
Self-supervised Learning	Uses data’s own structure for supervision	Language model training

Diagram: ML Workflow

ML Workflow

Key Algorithms

Linear Regression: Predicts continuous values.
Logistic Regression: Classifies binary outcomes.
Decision Trees: Splits data into branches for classification/regression.
Random Forests: Ensemble of decision trees for improved accuracy.
Support Vector Machines: Finds boundaries between classes.
Neural Networks: Mimics brain structure for complex tasks.

Surprising Facts

The human brain has more connections than there are stars in the Milky Way.
- The brain has ~100 trillion synapses, vastly outnumbering the Milky Way’s ~100 billion stars.
ML models can outperform humans in medical diagnosis.
- A 2020 study in Nature showed deep learning models surpass radiologists in breast cancer detection (McKinney et al., 2020).
Machine learning is used to discover new materials.
- ML algorithms have accelerated the discovery of novel compounds for batteries and semiconductors.

Interdisciplinary Connections

Neuroscience: ML architectures like neural networks are inspired by brain connectivity.
Statistics: ML relies on statistical methods for inference and prediction.
Physics: ML models simulate complex systems, e.g., predicting quantum states.
Biology: ML analyzes genomic data for disease prediction.
Ethics: Responsible AI development requires input from philosophy and law.
Education: Adaptive learning platforms use ML to personalize student experiences.

Debunking a Myth

Myth: Machine learning models are “black boxes” and cannot be understood.

Reality:

Advances in Explainable AI (XAI) provide tools to interpret model decisions.
Techniques like feature importance, LIME, and SHAP visualize how models make predictions.
Regulatory frameworks now require model transparency in sectors like finance and healthcare.

Connections to Technology

Automation: ML powers robotics, self-driving cars, and smart assistants.
Healthcare: ML enables personalized medicine, predictive diagnostics, and drug discovery.
Finance: Fraud detection, algorithmic trading, and risk assessment rely on ML.
Cybersecurity: ML detects anomalies and prevents attacks in real time.
Natural Language Processing (NLP): ML models understand and generate human language (e.g., chatbots, translation).

Recent Research

Reference:
McKinney, S. M., Sieniek, M., Godbole, V., Godwin, J., Antropova, N., Ashrafian, H., … & Suleyman, M. (2020). International evaluation of an AI system for breast cancer screening. Nature, 577(7788), 89-94.
Read Article

Diagram: Neural Network Structure

Challenges

Bias and Fairness: ML models may inherit biases from training data.
Data Privacy: Handling sensitive information securely is critical.
Scalability: Training large models requires significant computational resources.
Interpretability: Making models understandable to users and regulators.

Revision Checklist

Understand ML types and workflows.
Know key algorithms and their applications.
Recognize interdisciplinary links.
Be aware of ML’s impact on technology and society.
Stay updated with recent research and ethical considerations.