Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
47
29
192
Dr. Chad PhD
Doctor-Chad-PhD
Follow
Hamzarauf246's profile picture
JBS369ai's profile picture
21world's profile picture
7 followers
·
14 following
AI & ML interests
😎
Recent Activity
upvoted
an
article
10 minutes ago
Projected Abliteration
reacted
to
grimjim
's
post
with 🔥
11 minutes ago
I've uploaded abliteration code with support for sparsification of the refusal vector. It's poorly documented, but the code should be straightforward. https://github.com/jim-plus/llm-abliteration The code is built atop a fork that enabled abliteration to be performed on models loaded in 4-bit or 8-bit bitsandbytes quantization. TransformerLens is not required, just plain Transformers. For those previously unaware, this opens up abliteration experimentation to more people with local VRAM limitations. Since performing abliteration on a quant involves precision and perplexity loss, it stands to reason that a small amount of magnitude sparsification could filter out some noise and possibly even reduce the damage inflicted on latent space via ablation of the refusal vector. There's a small but real acceleration of ablation of the refusal vector by reducing outer product operations from O(d²×n) to O(d×n), and then by pushing said computation layerwise to GPU. The code is hardcoded for CUDA acceleration currently. Normalization of the refusal vector was deferred in order to allow sparsification. In principle other behavior vector interventions could also be added and explored.
reacted
to
grimjim
's
post
with ❤️
11 minutes ago
I've uploaded abliteration code with support for sparsification of the refusal vector. It's poorly documented, but the code should be straightforward. https://github.com/jim-plus/llm-abliteration The code is built atop a fork that enabled abliteration to be performed on models loaded in 4-bit or 8-bit bitsandbytes quantization. TransformerLens is not required, just plain Transformers. For those previously unaware, this opens up abliteration experimentation to more people with local VRAM limitations. Since performing abliteration on a quant involves precision and perplexity loss, it stands to reason that a small amount of magnitude sparsification could filter out some noise and possibly even reduce the damage inflicted on latent space via ablation of the refusal vector. There's a small but real acceleration of ablation of the refusal vector by reducing outer product operations from O(d²×n) to O(d×n), and then by pushing said computation layerwise to GPU. The code is hardcoded for CUDA acceleration currently. Normalization of the refusal vector was deferred in order to allow sparsification. In principle other behavior vector interventions could also be added and explored.
View all activity
Organizations
None yet
Doctor-Chad-PhD
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
zai-org/GLM-4.6
21 days ago
guys we also need some AIR
➕
❤️
49
23
#1 opened 28 days ago by
jacek2024
New activity in
ServiceNow-AI/Apriel-1.5-15b-Thinker
22 days ago
Too censored (Update: Working SillyTavern Jailbreak for Apriel-1.5-15b-Thinker by AutisticPancake)
👍
1
7
#7 opened 27 days ago by
Doctor-Chad-PhD
New activity in
ServiceNow-AI/Apriel-1.5-15b-Thinker
27 days ago
Doesn't stop thinking.
👍
1
9
#3 opened 27 days ago by
phil111
New activity in
Qwen/Qwen3-ASR-Demo
about 2 months ago
Request failed (Status: 400): <400> InternalError.Algo.InvalidParameter: The audio is too long
#1 opened about 2 months ago by
Doctor-Chad-PhD
New activity in
tencent/Hunyuan-MT-7B
about 2 months ago
GGUF quantization error
👀
4
6
#1 opened about 2 months ago by
Doctor-Chad-PhD
New activity in
deepseek-ai/DeepSeek-V3.1
2 months ago
recommended temp?
1
#19 opened 2 months ago by
createthis
New activity in
huihui-ai/Huihui-gpt-oss-120b-BF16-abliterated
2 months ago
Abliteration request
🤯
1
#4 opened 2 months ago by
Doctor-Chad-PhD
New activity in
Intel/Qwen3-30B-A3B-Instruct-2507-gguf-q2ks-mixed-AutoRound
3 months ago
Scored 0.71 on MMLU Pro
🔥
5
1
#2 opened 3 months ago by
xbruce22
New activity in
openai/gpt-oss-120b
3 months ago
Disgusting, maximally censored model!
👍
35
16
#56 opened 3 months ago by
Lord-Kvento
New activity in
bartowski/tencent_Hunyuan-4B-Instruct-GGUF
3 months ago
HF detecting viruses?
2
#1 opened 3 months ago by
JankyMudFart
New activity in
arcee-ai/AFM-4.5B-GGUF
3 months ago
Question about pretraining data
#1 opened 3 months ago by
Doctor-Chad-PhD
New activity in
Qwen/Qwen3-30B-A3B-Instruct-2507
3 months ago
An Improvement, But Q3 30b Still Has Very Little General Knowledge
👍
❤️
3
11
#2 opened 3 months ago by
phil111
New activity in
zai-org/GLM-4.1V-9B-Thinking
3 months ago
Please make MOE models
👍
3
12
#6 opened 4 months ago by
Narutoouz
New activity in
ByteDance-Seed/Seed-X-Instruct-7B
3 months ago
LM Studio with GGUF
7
#1 opened 3 months ago by
qixing
New activity in
ikawrakow/Qwen3-30B-A3B
3 months ago
GitHub account and ik_llama.cpp are down?!
😔
👍
4
60
#2 opened 3 months ago by
ubergarm
New activity in
moonshotai/Kimi-K2-Instruct
4 months ago
Kimi-K2-Mini
👍
22
15
#1 opened 4 months ago by
PSM24
New activity in
moelanoby/phi-3-M3-coder
4 months ago
🚩 Report: Spam
😔
➕
3
3
#3 opened 4 months ago by
kodecreer
New activity in
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
4 months ago
generation_config.json is missing
👀
👍
6
2
#5 opened 5 months ago by
Doctor-Chad-PhD
New activity in
osmosis-ai/Osmosis-Apply-1.7B
4 months ago
Error generating gguf quantization
3
#1 opened 4 months ago by
Doctor-Chad-PhD
New activity in
huggingface/InferenceSupport
4 months ago
tencent/Hunyuan-A13B-Instruct
🤗
🚀
40
4
#3004 opened 4 months ago by
celsowm
Load more