- Data-based vs. First-Principle: Top-down ML vs. bottom-up physics.
- Traceability: White (explainable), Grey (hybrid), and Black-Box (opaque) models.
- Overfitting: Excess complexity fails to generalize.
- Types: Nominal, Ordinal, Cardinal, Binary.
- Scales: Nominal, Ordinal, Interval, Ratio.
- Uncertainty: Units and measurement error are essential.
- PSPP Paradigm: Processing-Structure-Property-Performance dependency.
- Noise: Physical noise, aliasing, and instrument bias.
- Scarcity: High-quality data is expensive and rare.
- AI 4 Materials goals.
- Hype vs. Reality.
- Convergence of high-throughput, simulation, and ML.
- Prediction vs. Explanation.
- Physics-based vs. Data-driven.
- Moving toward White-Box ML.
- PSPP graph.
- Multi-modal data types.
- Small Data challenges.
- Physical Priors.
- Categorizing data.
- Metadata and Units.
- Error propagation.
- Adapting industrial standards.
- Deployment.
- Correlation != Causality.
Summary for ML-PC Week 1:
- Transitions from physics-based to data-driven modeling.
- Highlights multi-modal, scarce materials data.
- Covers PSPP relationships, data scales, and uncertainty.
- Adapts CRISP-DM for scientific workflows.