Machine Learning Study Notes
Definition
Machine Learning (ML) is a subset of artificial intelligence (AI) focused on algorithms that enable computers to learn patterns from data and make decisions or predictions without explicit programming.
Core Concepts
- Data: Raw information used for training models.
- Model: Mathematical representation that learns from data.
- Training: Process where the model adjusts parameters based on input data.
- Testing: Evaluating model performance on unseen data.
- Features: Individual measurable properties or characteristics of data.
- Labels: Target output the model tries to predict.
Types of Machine Learning
Type | Description | Example |
---|---|---|
Supervised Learning | Learns from labeled data | Email spam detection |
Unsupervised Learning | Finds patterns in unlabeled data | Customer segmentation |
Reinforcement Learning | Learns by interacting with environment, receiving feedback | Game playing AI |
Semi-supervised Learning | Mixes labeled and unlabeled data | Image classification |
Self-supervised Learning | Uses data’s own structure for supervision | Language model training |
Diagram: ML Workflow
Key Algorithms
- Linear Regression: Predicts continuous values.
- Logistic Regression: Classifies binary outcomes.
- Decision Trees: Splits data into branches for classification/regression.
- Random Forests: Ensemble of decision trees for improved accuracy.
- Support Vector Machines: Finds boundaries between classes.
- Neural Networks: Mimics brain structure for complex tasks.
Surprising Facts
-
The human brain has more connections than there are stars in the Milky Way.
- The brain has ~100 trillion synapses, vastly outnumbering the Milky Way’s ~100 billion stars.
-
ML models can outperform humans in medical diagnosis.
- A 2020 study in Nature showed deep learning models surpass radiologists in breast cancer detection (McKinney et al., 2020).
-
Machine learning is used to discover new materials.
- ML algorithms have accelerated the discovery of novel compounds for batteries and semiconductors.
Interdisciplinary Connections
- Neuroscience: ML architectures like neural networks are inspired by brain connectivity.
- Statistics: ML relies on statistical methods for inference and prediction.
- Physics: ML models simulate complex systems, e.g., predicting quantum states.
- Biology: ML analyzes genomic data for disease prediction.
- Ethics: Responsible AI development requires input from philosophy and law.
- Education: Adaptive learning platforms use ML to personalize student experiences.
Debunking a Myth
Myth: Machine learning models are “black boxes” and cannot be understood.
Reality:
- Advances in Explainable AI (XAI) provide tools to interpret model decisions.
- Techniques like feature importance, LIME, and SHAP visualize how models make predictions.
- Regulatory frameworks now require model transparency in sectors like finance and healthcare.
Connections to Technology
- Automation: ML powers robotics, self-driving cars, and smart assistants.
- Healthcare: ML enables personalized medicine, predictive diagnostics, and drug discovery.
- Finance: Fraud detection, algorithmic trading, and risk assessment rely on ML.
- Cybersecurity: ML detects anomalies and prevents attacks in real time.
- Natural Language Processing (NLP): ML models understand and generate human language (e.g., chatbots, translation).
Recent Research
- Reference:
McKinney, S. M., Sieniek, M., Godbole, V., Godwin, J., Antropova, N., Ashrafian, H., … & Suleyman, M. (2020). International evaluation of an AI system for breast cancer screening. Nature, 577(7788), 89-94.
Read Article
Diagram: Neural Network Structure
Challenges
- Bias and Fairness: ML models may inherit biases from training data.
- Data Privacy: Handling sensitive information securely is critical.
- Scalability: Training large models requires significant computational resources.
- Interpretability: Making models understandable to users and regulators.
Revision Checklist
- Understand ML types and workflows.
- Know key algorithms and their applications.
- Recognize interdisciplinary links.
- Be aware of ML’s impact on technology and society.
- Stay updated with recent research and ethical considerations.
Further Reading
- Explainable AI: Interpreting Machine Learning Models
- AI for Materials Discovery
- Responsible AI Guidelines