Making LLM Training Faster with Unsloth and NVIDIA

(unsloth.ai)

48 points | by segmenta 4 hours ago ago

8 comments

  • stared 2 hours ago

    While I do admire Unsloth (especially their https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF binarizations), the linked blog post looks like written by AI from notes (unless a human author acquired this taste from interactions with chatbots).

    • danielhanchen 26 minutes ago

      Oh thanks :) We're also going to add MTP support soon for Qwen3.6!

      90% of it is fully human done - the maths, algos, code snippets, screenshots & benchmarks are done / conducted by us and NVIDIA :)

      We did use AI to fix spelling errors, and spice up the intro a bit + made some nice plots using Chat (ours would look horrible lol)

    • adityamwagh 30 minutes ago

      What’s with the all the hate for AI assisted writing on HackerNews? It’s a tool and people use tools all the time. It saves TIME and helps in improving coherence of one’s articles.

      • stingraycharles 8 minutes ago

        It destroys the previously implicit contract that the writer actually spent a decent amount of thought and time into the writing, and that the ideas expressed are theirs and original.

        I don’t mind good usage of LLM assisted writing, but if the author can’t even be bothered identifying the most obvious AI tells, I use it as a proxy that the author probably but very little effort into the article.

        It’s also often a horribly verbose style, where the same ideas could be presented with 20% of the prose.

        It’s also ruining the entire experience on web communities (although here on HN the moderation team seems to get a hold of keeping them at bay at this point, much appreciated).

        All in all, it’s objectively a net negative for the readers, and serves only the author.

        I prefer original, less coherent articles that are genuine and where I know the ideas expressed are really the author’s and not the LLM’s inference.

        Last but not least, I don’t think the grandparent you’re replying to was particularly hateful in the grand scheme of things.

      • saberience 26 minutes ago

        Because AI writing is lazy and moreover, I don’t want to know the AIs opinion on something, I can get that myself, if I want to read someone’s article I want to hear that persons words and that persons opinions.

        If someone has no opinions or unique insight then why would I listen to them or read their content.

        Again, if I want the AIs view on something I can open up Claude and ask them myself, why bother reading generated articles that took 10 seconds for someone else to prompt?

        • danielhanchen 24 minutes ago

          The intro was spiced up (I guess we won't do it from now on :)) but the rest 90% maths, benchmarks, code, explanations etc are fully done by us!

  • electroglyph an hour ago

    nice writeup! looking forward to doing some more training as soon as i get some more data sorted. it'll be a custom arch, but i'll probably shoehorn it into unsloth for a speed boost.