RT @MikaStars39: Key takeaway from the M3 blog: M3 independently reproduce an ICLR 2025 Outstanding Paper Award winner, "Learning Dynamics…
@MiniMax_AI·1 de jun. de 2026·2 fontes
Ler artigoTodas as fontes
RT @MikaStars39: Key takeaway from the M3 blog: M3 independently reproduce an ICLR 2025 Outstanding Paper Award winner, "Learning Dynamics…
@MiniMax_AI
1 de jun. de 2026
RT @MikaStars39: CUDA kernel optimization in M3 Blog: - FP8 GEMM: most compute-heavy and hardest-to-optimize part of inference; ~1–2 weeks…
@MiniMax_AI
1 de jun. de 2026