Has anyone tried in organization to use self hosted llm models for agentic programming?

Im curious if it makes any sense. My organization spends fortune on tokens from us companies. I want to recommend something…

  • PeeOnYou [he/him]@lemmygrad.ml
    link
    fedilink
    arrow-up
    0
    ·
    3 days ago

    its shared sure, but the bandwidth is crap compared to a dedicated nvidia card. the performance will suffer, even though it allows you to run larger quants