• Kuvwert@lemm.ee
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    21 hours ago

    https://ibb.co/wVNsn5H

    https://ibb.co/HpK5G5Pp

    https://ibb.co/sp1wGMFb

    https://ibb.co/4wyKhkRH

    https://ibb.co/WpBTZPRm

    https://ibb.co/0yP73j6G

    Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)

    Edit 1:

    Incidentally… I propositioned a coworker to answer the same question. This is the summarized conversation I had:

    Me: “Hey Billy, can you answer a question? in under 3 seconds answer my following question”

    Billy: “sure”

    Me: “How many As are in abracadabra 3.2.1”

    Billy: “4” (answered in less than 3 seconds)

    Me: “nope”

    I’m gonna poll the office and see how many people get it right with the same opportunity the ai had.

    Edit 2: The second coworker said “6” in about 5 seconds

    Edit 3: Third coworker said 4, in 3 seconds

    Edit 4: I asked two more people and one of them got it right… But I’m 60% sure she heard me asking the previous employee, but if she didnt we’re at 1/5

    In probably done with this game for the day.

    I’m pretty flabbergasted with the results of my very unscientific experiment, but now I can say (with a mountain of anecdotal juice) that with letter counting, R1 70b is wildly faster and more accurate than humans .