RT @MikaStars39: Key takeaway from the M3 blog: M3 independently reproduce an ICLR 2025 Outstanding Paper Award winner, "Learning Dynamics…

@MiniMax_AI·1 de jun. de 2026·2 fontes

Todas as fontes

RT @MikaStars39: CUDA kernel optimization in M3 Blog: - FP8 GEMM: most compute-heavy and hardest-to-optimize part of inference; ~1–2 weeks…