New technique makes AI models leaner and faster while they’re still learning

Source Domain: news.mit.edu

New Compression Method During Training: Researchers have developed CompreSSM, a technique to compress state-space models during training, avoiding the trade-off between model size and performance improvement.
Advantages Over Traditional Methods: Unlike conventional pruning or knowledge distillation methods, CompreSSM enables informed compression mid-training, saving computational resources and time without sacrificing accuracy.
Efficiency and Performance: On benchmarks like CIFAR-10 and Mamba, CompreSSM-trained models match the performance of larger models while training up to 1.5 times faster and achieving up to 4x training speedups.
Theoretical Grounding: The method leverages control theory, specifically Hankel singular values, to rank model components by their importance early in training, ensuring only significant parts are retained.
Safe Compression: CompreSSM includes a safety net feature allowing practitioners to revert to saved checkpoints if a compression step adversely affects performance.
Effectiveness on Certain Models: The method works particularly well on multi-input, multi-output models and linear time-invariant systems, with potential extensions to other state-space model architectures, including transformers.
Future Directions: Researchers aim to extend CompreSSM to more complex architectures and dynamical systems, aiming for wider applicability in modern AI systems.
Conference and Support: The work was accepted for presentation at the International Conference on Learning Representations 2026, supported by several institutions including the Max Planck Institute and U.S. Office of Naval Research.

New technique makes AI models leaner and faster while they’re still learning | MIT News

STAT Health Tech: FDA rejects proposal to deregulate AI devices

MIT Technology Review’s EmTech AI 2026: Leading in the Era of AI Integration

J.S. Held Strengthens Cross-Disciplinary Response to Artificial Intelligence (AI) Disputes

The Most Important Cybersecurity Trends in 2026 So Far

STAT Health Tech: FDA rejects proposal to deregulate AI devices

MIT Technology Review’s EmTech AI 2026: Leading in the Era of AI Integration

AI is supercharging the cybersecurity fight

J.S. Held Strengthens Cross-Disciplinary Response to Artificial Intelligence (AI) Disputes

More Stories

You may have missed