![]() |
|
![]() |
| It’s dishonest because they not only point towards a specific language model, but the beta version of a specific model. WTH? |
![]() |
| perhaps copyright needs to be updated. And in any case, my personal belief is that training on data that is publicly released, and as well as purchased media, is fair use. |
![]() |
| The people who propose that authors lose money by chatGPT's usage of their works in the training, is the same idea that piracy costs music labels money. |
![]() |
| Inference time compute is still very under utilized in actual AI deployments. Lots of folks are working on foundation models, which require reasoning about broad problem domains. Not enough people are using the same techniques for task-specific performance improvements. You can easily distill the reasoning from larger models like R1 for your task. Often better, you can mix in custom thinking instructions for specific sub-problems so a fine tuned model learns a mix of task specific reasoning and custom logic. It’s not hard and easily beats prompt iteration. When you find bugs, you can fix it.
I made a GitHub project for distilling thinking models (and customs COT inference time fine tuning): https://docs.getkiln.ai/docs/guide-train-a-reasoning-model |
![]() |
| Thanks for the input. It sounds like the task is about as daunting as it seems, then, but doable. Are there any resources (such as papers) you’ve found especially helpful? |
![]() |
| I just want to make music with AI and it is very difficult. The meta model on hugging gives an error when used through the website and no one will ever fix it. |
![]() |
| It depends on how much you want it to do for you. I've used ChatGPT to come up with song briefs which I then turn into music myself. |
![]() |
| You can describe or upload the first N seconds, then extend from that by using another description, then extend from N further seconds etc. But Suno music within a genre has a pretty limited range. |
![]() |
| Wait so Qwen trained QWQ 32B from Qwen 32B and then they distill QWQ back into Qwen 32B? What's the point?
This is massive marketing scam here. Borderline academic dishonesty. |
How long does this take on a free tier T4? This is really neat, I’d assumed this type of “playing with the guts” work was more difficult to access as a normie programmer. Looks like something I’d like to try!