ads

Monday, May 5, 2025

Show HN: DistilKitPlus, a distillation framework between any LLMs https://ift.tt/L7iyA9Y

Show HN: DistilKitPlus, a distillation framework between any LLMs Over the past few months, I have built a distillation toolkit that supports cross-tokenizer distillation (e.g., distilling from LLaMA to Qwen vocab, or others). This approach has worked well on reasoning datasets like AIME, and we’ve validated on models like Phi and Qwen. We’ve also integrated Modal for quick deployment (with $30/month credits to try it out). Would love any feedback! GitHub: https://ift.tt/C1zdBKG Docs: https://ift.tt/2Xrw1Gb https://ift.tt/C1zdBKG May 5, 2025 at 11:12PM

No comments:

Post a Comment