Sam Heutmaker

Founder & CEO 於 Inference

Sam Heutmaker 目前是 Inference 的創辦人兼執行長。

目前職位

Founder & CEO

新聞提及

If this happens, the biggest AI trend of the next decade won’t be larger models. It’ll be open models and local/private inference. Then again not sure most people care about privacy to make a conscious choice about which AI products they use. Cloud beat local. Social media beat privacy concerns. M

@lmrankhan2026年6月

Local inference not looking so crazy anymore now that companies are revolting over token spend

@fejau_inc2026年6月

Local inference not looking so crazy anymore now that companies are revolting over token spend

@citrini2026年6月

Allora x @trepa_io Partnering with Trepa, a new prediction primitive on @solana that rewards closeness to the outcome, not direction. 1-minute rounds. What this means: Participants can now use Allora's inference data to sharpen their calls. More to come.

@coinfund2026年6月

When intelligence is cheap, verification becomes the scarce resource. @FourPillarsFP breaks down how OpenGradient is building the trust layer for AI inference, verifiable calls, on-chain attestation, private inference, and payments in a single execution. Read the full research 👇

@yueya_eth2026年6月

NVIDIA announces Nemotron 3 Ultra, an open model built for long-running AI agents that need to plan, reason, and use tools across complex workflows. Up to 5x faster inference and 30% lower cost for agentic tasks.

@martybent2026年6月

Did everyone just MISS this? With x402 PAYMENTS now in the ecosystem…. $GITLAWB has became the ONLY platform where AI agents: Store code, push repositories, earn bounties, AND pay for inference. Coinbase is pushing for x402 to become the standard for agent payments. And $GITLAWB is now plugged

@studious_crypto2026年6月

Introducing NVIDIA Nemotron 3 Ultra. A frontier smart open model built for long-running agents that need to plan, reason, use tools and keep working across complex coding, research and enterprise workflows. Up to 5x faster inference and up to 30% lower cost for agentic tasks. Learn more: https://

@clementdelangue2026年6月

Nemotron 3 Ultra (550B-A55B) is here - our strongest open-weight model and full training recipe to date. Heavy emphasis on real-world inference efficiency for long-context agentic workloads. Everything is open 🤗: base, post-trained, reward checkpoints, NVFP4 quantized versions, training data, an

@clementdelangue2026年6月

RT @theo: In order to hit the limit of your $40 Copilot plan, you have to do at least $60 of inference. The previous limit structure was e…

@steipete2026年6月