Managers

inari@piefed.zip · 14 days ago

Managers

Kaligalis@lemmy.world · 14 days ago

It might not be as impossible as it sounds. Some of the “open” models are rumored to be able to code. The real problem is that you likely need something with 128 GiB VRAM to run them with a reasonably large context window.

IratePirate@feddit.org · 14 days ago

An Nvidia B200 (192 Gigs of RAM) sells somewhere between 30-50k a pop. That’s feasible for a company.

Kazumara@discuss.tchncs.de · 13 days ago

And then you can serve one inference at a time. Hopefully your devs are well distributed over timezones :-)

Diurnambule@jlai.lu · 13 days ago

Wonderfull idea, may be they can connect to the same PC, and we can call it main frame or something. xD

baronofclubs@lemmy.world · 13 days ago

I don’t see why it wouldn’t be feasible to rent someone else’s computer to use for something like this, seeing how it could amortize costs over time.

mindbleach@sh.itjust.works · 14 days ago

Qwen’s 27B model from April outperforms its 397B model from February.

Local and small were always going to win.

Diurnambule@jlai.lu · 13 days ago

Qwen 3.6 ? It is unstable though. It go awry more often than the 3.5 of the same size.