• Kuvwert@lemm.ee
    link
    fedilink
    English
    arrow-up
    4
    ·
    2 days ago

    Non thinking prediction models can’t count the r’s in strawberry due to the nature of tokenization.

    However openai o1 and deep seek r1 can both reliably do it correctly