• SpicyToaster420@sopuli.xyz
    link
    fedilink
    arrow-up
    4
    ·
    3 days ago

    Awesome use of LLMs. I wonder they didn’t use FP8 quantization though, especially since their target hardware was an L40s.