Making LLM Training Faster with Unsloth and NVIDIA

(unsloth.ai)

48 points | by segmenta 4 hours ago ago

8 comments

stared 2 hours ago
While I do admire Unsloth (especially their https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF binarizations), the linked blog post looks like written by AI from notes (unless a human author acquired this taste from interactions with chatbots).
[-]
- danielhanchen 26 minutes ago
  Oh thanks :) We're also going to add MTP support soon for Qwen3.6!
  90% of it is fully human done - the maths, algos, code snippets, screenshots & benchmarks are done / conducted by us and NVIDIA :)
  We did use AI to fix spelling errors, and spice up the intro a bit + made some nice plots using Chat (ours would look horrible lol)
- adityamwagh 30 minutes ago
  What’s with the all the hate for AI assisted writing on HackerNews? It’s a tool and people use tools all the time. It saves TIME and helps in improving coherence of one’s articles.
  [-]
  - stingraycharles 8 minutes ago
    It destroys the previously implicit contract that the writer actually spent a decent amount of thought and time into the writing, and that the ideas expressed are theirs and original.
    I don’t mind good usage of LLM assisted writing, but if the author can’t even be bothered identifying the most obvious AI tells, I use it as a proxy that the author probably but very little effort into the article.
    It’s also often a horribly verbose style, where the same ideas could be presented with 20% of the prose.
    It’s also ruining the entire experience on web communities (although here on HN the moderation team seems to get a hold of keeping them at bay at this point, much appreciated).
    All in all, it’s objectively a net negative for the readers, and serves only the author.
    I prefer original, less coherent articles that are genuine and where I know the ideas expressed are really the author’s and not the LLM’s inference.
    Last but not least, I don’t think the grandparent you’re replying to was particularly hateful in the grand scheme of things.
  - saberience 26 minutes ago
    Because AI writing is lazy and moreover, I don’t want to know the AIs opinion on something, I can get that myself, if I want to read someone’s article I want to hear that persons words and that persons opinions.
    If someone has no opinions or unique insight then why would I listen to them or read their content.
    Again, if I want the AIs view on something I can open up Claude and ask them myself, why bother reading generated articles that took 10 seconds for someone else to prompt?
    [-]
    - danielhanchen 24 minutes ago
      The intro was spiced up (I guess we won't do it from now on :)) but the rest 90% maths, benchmarks, code, explanations etc are fully done by us!
electroglyph an hour ago
nice writeup! looking forward to doing some more training as soon as i get some more data sorted. it'll be a custom arch, but i'll probably shoehorn it into unsloth for a speed boost.
[-]
- danielhanchen 25 minutes ago
  Thank you!