FlashMoe Running a 397B Parameter Model on a Mac with 48GB RAM

News Source : Github.com

News Summary

Pure C/Metal inference engine that runs Qwen3.5-397B-A17B (a 397 billion parameter Mixture-of-Experts model) on a MacBook Pro with 48GB RAM at 4.4+ tokens/second.
Model has 60 transformer layers: 45 GatedDeltaNet (linear attention) + 15 standard full attention.
Each layer has 512 experts, of which K=4 are activated per token (plus one shared expert).
Hidden dimension is 4096.

Read the paper Full technical details, 90+ experiments, and the story of how an AI and a human built this in 24 hours.Pure C/Metal inference engine that runs Qwen3.5397BA17B (a 397 billion parame [+7668 chars]

Must read Articles

News image for article FlashMoe Running a 397B Parameter Model on a Mac with 48GB RAM

What channel is Arizona vs. Utah State on? Time, TV schedule, live stream to watch March Madness Round 2 game

News Source : By Teddy Ricketson from Sporting News

SNL UK Debuts, Always Sunny, The Rookie & BTS BCTV Daily Dispatch

News Source : By Ray Flook from Bleeding Cool News

Eid 2026 Shah Rukh Khan Wishes Everyone Joy and Peace, Says May We Get All That We Pray For (See Post)

News Source : By Ians from LatestLY

Danielle Collins is a mixed blessing for the Tennis Channel

News Source : By Wendi Oliveros from The Big Lead

The Provocation of The Pitt

News Source : By Josh Tyrangiel from The Atlantic

The Passing Of Sam Kieth The Daily LITG, 22nd of March 2026

News Source : By Rich Johnston from Bleeding Cool News

AIGenerated Comic Dramas Disrupting Chinas Entertainment Industry and Investment Landscape

News Source : By Eliza Wong from Yuantrends.com

Trump touted bigger tax refunds this year, but Americans will likely spend them on gas

News Source : By CHRISTOPHER RUGABER AP economics writer from Abcnews.com

Higher tax refunds will likely be used to offset rising gas prices

News Source : By Christopher Rugaber from Associated Press

10 Wild Clips from Afromans Court Victory over Cops He Mocked in Songs After Erroneous House Raid

News Source : By Alana Mastrangelo from Breitbart News

King Charles chicken coop at Highgrove has a name that will make you do a double take

News Source : By from Fox News

Saturday Night Live UK Premieres With Healthy Ratings For Sky

News Source : By Jake Kanter from Deadline

Content cuts across formats for wider audiences, more value

News Source : By Lata Jha from Livemint

3 Reasons Why Viewers Are Captivated By Ju Ji Hoon And Ha Ji Wons New Drama Climax

News Source : By S Nam from soompi

Mexico Citys Xoli Chatbot Will Help World Cup Tourists Navigate the City

News Source : By Fernanda González from Wired

TCL Note A1 NXTPAPER Review PaperLike Display Meets HighSpeed Productivity in Latest ENote Tablet

News Source : By Daniel Lee from Ibtimes.com.au

Why Was PlayStation Network Down Lately? Network Experiences Global Outage, Disrupting Online Gaming

News Source : By Daniel Lee from Ibtimes.com.au