• errer@lemmy.world
    cake
    link
    fedilink
    English
    arrow-up
    10
    ·
    15 hours ago

    I mean the poster above you is wrong, they use math tools internally now when you ask math questions. Very obvious in Gemini. Yes the raw LLM trying to autocomplete the answer to a math problem is gonna be wrong but that’s not the way they are used to solve problems like that anymore.

    • sbv@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      7
      ·
      14 hours ago

      The LLM has to choose to use the calculating tools. Gemini tried to do this one solo:

      4 + 2 + 2 + 2 + 1+ 2 + 0 = 15

      Tbf, it did four of these calculations, and 75% were correct.