publications
publications by categories in reversed chronological order
2025
-
- Sign-SGD is the Golden Gate between Multi-Node to Single-Node Learning: Significant Boost via Parameter-Free Optimization2025
-
- When Extragradient Meets PAGE: Bridging Two Giants to Boost Variational InequalitiesIn The 41st Conference on Uncertainty in Artificial Intelligence, 2025
- Closing the Curvature Gap: Full Transformer Hessians and Their Implications for Scaling Laws2025