Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

danielhanchenย 
posted an update about 24 hours ago
SeanLee97ย 
posted an update 1 day ago
view post
Post
6761
Our lab recently released a paper where we introduce ShadowPEFT, a new Parameter-Efficient Fine-Tuning (PEFT) paradigm tailored for edge computing scenarios.

Unlike traditional approaches such as LoRA and its variants, which inject trainable parameters directly into the weights of Transformer, requiring tight coupling with the backbone.

ShadowPEFT instead enhances the frozen large base model by adding a lightweight, centralized, pretrainable, and detachable Shadow network.
This shadow network operates in parallel with the base model, delivering learned corrections to each decoder layer. Because the shadow module is architecturally decoupled from the backbone, it can be independently trained, stored, and deployed, benefiting edge computing scenarios and edge-cloud collaboration computing.

- HF Paper: ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning (2604.19254)
- GitHub: https://github.com/ShadowLLM/shadow-peft
- HF Collection: https://huggingface.co/collections/shadow-llm/shadow-peft-models
  • 7 replies
ยท
Ujjwal-Tyagiย 
posted an update 2 days ago
view post
Post
3779
We are hiring at Shirova AI. We need AI researchers and engineers to work in our research lab. Shirova AI is a research lab in India, so we can help our researchers move to nearby workspaces or let them work from home without ever coming to the lab. We're building our founding team, so the pay will be good. You can learn, so don't hesitate to mail us at: careers@shirova.com
imnotkittyย 
posted an update about 5 hours ago
view post
Post
424
tencent/Hy3-preview is out: an open-weights MoE reasoning model.

โœ… 295B total / 21B active / 256K context
โœ… Fused fast-and-slow thinking in a single model
โœ… First model trained on Hunyuan's rebuilt pretraining + RL infra (Feb โ†’ Apr)

Benchmarks:
๐Ÿ‘‰ SWE-Bench Verified, Terminal-Bench 2.0, BrowseComp, WideSearch โ€” competitive results, particularly strong on agentic tool use
๐Ÿ‘‰ Top score on Tsinghua's 2026 Spring math PhD qualifying exam
๐Ÿ‘‰ Strong context-learning and instruction-following on Tencent's CL-bench / CL-bench-Life

More details can be found in my article: https://huggingface.co/blog/imnotkitty/hy3-preview
  • 1 reply
ยท
dealermatt72ย 
posted an update 3 days ago
view post
Post
6601
Hey Hugging Face community ๐Ÿ‘‹

My name is M. I'm a solo founder and self-taught developer based in Houston, TX. I build AI-powered apps โ€” I have an iOS app called DeFilter currently in App Store review, a security scanning platform called Sentinel, and a job marketplace called HireHuman.fyi for connecting humans with companies that prefer non-AI workers.

I'm also a poker dealer by night, which means I think a lot about reading situations in real time โ€” and that's exactly what sparked this idea.

I'm not the most technical person in the room. But I have a vision, I have drive, and I believe the best projects get built when people with different skills come together around a shared idea.

That's why I'm posting here. I want to build this with the community.

โ€” M (@dealermatt )

  • 3 replies
ยท
anakin87ย 
posted an update 3 days ago
view post
Post
10304
How LLM training with RL Environments works?

It all starts with ๐—ฅ๐—ฒ๐—ถ๐—ป๐—ณ๐—ผ๐—ฟ๐—ฐ๐—ฒ๐—บ๐—ฒ๐—ป๐˜ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด ๐˜„๐—ถ๐˜๐—ต ๐—ฉ๐—ฒ๐—ฟ๐—ถ๐—ณ๐—ถ๐—ฎ๐—ฏ๐—น๐—ฒ ๐—ฅ๐—ฒ๐˜„๐—ฎ๐—ฟ๐—ฑ๐˜€
- question asked
- model generates reasoning + answer
- answer checked against ground truth
- reward drives RL training


In this setup, the environment is simple: fixed questions and answers, rollout logic, reward(s)

Consider a more complex tic-tac-toe env โŒโญ•
It adds:
- dynamic game generation/handling
- tunable opponent skill
- multi-turn interactions

(envs can also include tools)

---

What happens at training?

We use ๐—š๐—ฟ๐—ผ๐˜‚๐—ฝ ๐—ฅ๐—ฒ๐—น๐—ฎ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—ฃ๐—ผ๐—น๐—ถ๐—ฐ๐˜† ๐—ข๐—ฝ๐˜๐—ถ๐—บ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป with a tic-tac-toe env

No critic model needed, the group is the baseline
Simpler than PPO

1๏ธโƒฃ Rollout generation: from the same board, model plays N games via sampling
2๏ธโƒฃ Each game scored with deterministic rewards (win, format, ...)
3๏ธโƒฃ Mean score computed across the group
4๏ธโƒฃ Each rollout's advantage = its score minus the group mean
5๏ธโƒฃ Model updated to favor trajectories above baseline

๐Ÿ” Repeat


For a deep dive, check out
๐ŸŒฑ https://github.com/anakin87/llm-rl-environments-lil-course
a free hands-on course on RL environments for LLMs
  • 2 replies
ยท
Benedictatย 
posted an update about 5 hours ago
view post
Post
164
Built a WeChat Mini Program in 20 minutes flat with Hy3 Preview + WorkBuddyโ€ฆ

and I didnโ€™t type a single line of code. Not even a semicolon.

This Coding Agent is on steroids. Its comprehension in long back-and-forths is night and day better, and that 256K context window swallows the entire project structure whole.

Tell it what you want, and it actually gets the full picture no confused blank stares from the AI.

And weโ€™re not messing around with dinky little code snippets here. It spits out a fully functional project

app.json, every pageโ€™s wxml/wxss/js/json, even Mock data pre-packed. Import it into WeChat Dev Tools and it runs on the first try

Only one tiny visual nitpick, zero logic bugs. Point out the flaw, and it fixes it instantly no new bugs, no passive-aggressive code breaks, no headaches

The entire vibe Tell it your idea โ†’ Get a complete working project โ†’ Mention a tiny flaw โ†’ AI polishes it.

No coding, no endless edits, no soul-crushing debugging that makes you want to throw your laptop. Absolute game-changer
wangbuer999ย 
posted an update about 5 hours ago
view post
Post
167
Testing AI controlling AI with Hy3 Preview I barely lifted a finger the whole time.

One-click deployment of Hermes on WorkBuddy took some time with a few rounds of adjustments, and I finally got it up and running smoothly

Only minor issue was setting up Supermemory it was a bit slow on the uptake. I had to go over simple steps several times, guiding it patiently like teaching a kid.

The experience of AI orchestrating AI is absolutely incredible. started running Agents with Hunyuan right after its release, and it actually works perfectly.

295B parameters, 21B active parameters, with direct access to TokenHub now great cost-performance ratio too

Honestly, I used to get stuck on all kinds of environment configurations when deploying Agents locally. Using Hy3 to take command made the whole process way more streamlined.
kelsendย 
posted an update about 5 hours ago
view post
Post
191
The rebuilt Hunyuan HY3 Preview is here!

I tested it on all the tricky scenarios where most LLMs usually face-plantโ€”and guess what? It didnโ€™t flop.

295B total params, 21B active params, 256K context window. Built on MoE architecture, it delivers trillion-parameter-level performance with a much smaller footprint. Long-context capabilities get a massive upgrade.

Agent abilities stand out this time: tool calling, workflow orchestration, and autonomous planning are far more stable in real business scenarios. AI PPT generation in Tencent Docs is also significantly smoother and more reliable.

Real-world tests on WorkBuddy show first-token latency down 54%, success rate over 99.99%, and an Agent workflow that ran continuously for 495 steps.

Its Coding Agent achieved top-tier results on both SWE-Bench Verified and Terminal-Bench 2.0

Now open-sourced on GitHub, HuggingFace, and ModelScope. Available on TokenHub at just 1.2 RMB per million tokens.
MonsterMMORPGย 
posted an update about 12 hours ago
view post
Post
59
The ULTIMATE Guide to AI Voice Cloning: RVC WebUI (Zero to Hero)

Links

Tutorial link : https://youtu.be/ZRrzvD4wNys

App link : https://www.patreon.com/posts/rvc-web-ui-app-installer-zip-file-149104996

Info

Ultimate AI Voice Changer Tutorial: SECourses Premium RVC Web UI (Windows, RunPod & Massed Compute). This video is only for educational and responsible usage purposes. With V3: Multiple voice merge to generate custom voice feature implemented.

Welcome to the complete tutorial for the SECourses Premium RVC Web UI! In this video, I will show you how to easily transform your speaking voice or song vocals using our highly optimized AI voice conversion application.
Whether you want to sound like a famous celebrity (like Donald Trump or Tupac), replace vocals in AI-generated music, or change your voice live in real-time, this tool has everything you need. The installer automatically downloads 30+ pre-trained demo voices, and you can easily add hundreds more from Hugging Face!

I will walk you through the entire 1-click installation process for local Windows PCs, as well as step-by-step cloud deployment guides for RunPod, SimplePod, and our highly recommended cloud provider, Massed Compute.

โœจ Key Features Covered in this Video:
โ€ข 1-Click Install with automatic model & dependency downloading

โ€ข 30+ Pre-trained demo voices included automatically

โ€ข Auto-separation of vocals and music for song processing

โ€ข Custom Preset System to save your favorite inference settings

โ€ข Batch Inference to process multiple audio files instantly

โ€ข Real-Time Live Voice Changing desktop application included

โ€ข Full Custom Voice Training support (VRAM usage below 4GB!)