Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
89
112
140
Feynman Innovations
ajibawa-2023
Follow
cubic2023's profile picture
hppdqdq's profile picture
No011sukieyiyi's profile picture
176 followers
Β·
23 following
AjinkyaBawase
AI & ML interests
LLM, RL, DL, ML, AGI. Developing LLMs (preferably fully fine tuned ) for various use cases.
Recent Activity
reacted
to
DmitryRyumin
's
post
with π₯
6 days ago
ππ€π New Research Alert - ICCV 2025 (Oral)! ππ€π π Title: Variance-based Pruning for Accelerating and Compressing Trained Networks π π Description: The one-shot pruning method efficiently compresses networks, reducing computation and memory usage while retaining almost full performance and requiring minimal fine-tuning. π₯ Authors: Uranik Berisha, Jens Mehnert, and Alexandru Paul Condurache π Conference: ICCV, 19 β 23 Oct, 2025 | Honolulu, Hawai'i, USA πΊπΈ π Paper: https://huggingface.co/papers/2507.12988 π ICCV-2023-25-Papers: https://github.com/DmitryRyumin/ICCV-2023-25-Papers π Added to the Efficient Learning Section: https://github.com/DmitryRyumin/ICCV-2023-25-Papers/blob/main/sections/2025/main/efficient-learning.md π More Papers: more cutting-edge research presented at other conferences in the https://huggingface.co/spaces/DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin π Keywords: #VarianceBasedPruning #NetworkCompression #ModelAcceleration #EfficientDeepLearning #VisionTransformers #AI #ICCV2025 #ResearchHighlight
reacted
to
onekq
's
post
with π
6 days ago
Context rot is such a catchy phrase, but the problem has been identified 2+ years ago, called attention decay. https://huggingface.co/papers/2307.03172 I spotted the same problem in coding tasks, and documented in my book (https://www.amazon.com/dp/9999331130). Why did this problem become hot again? This is because many of us thought the problem has been solved by long context models, which is not true. Here we were misled by benchmarks. Most long-context benchmarks build around the QA scenario, i.e. "finding needle in haystack". But in agentic scenarios, the model needs to find EVERYTHING in the haystack, and just can't afford enough attention for this challenge.
reacted
to
di-zhang-fdu
's
post
with π₯
6 days ago
The training dataset of ChemVLM is open-sourced now, have a check! https://huggingface.co/datasets/di-zhang-fdu/chemvlm-sft-datasets papers: https://huggingface.co/papers/2408.07246
View all activity
Organizations
ajibawa-2023
's datasets
21
Sort:Β Recently updated
ajibawa-2023/Persona-100k
Viewer
β’
Updated
Jul 13
β’
100k
β’
46
β’
5
ajibawa-2023/Reasoning-Maths-College
Viewer
β’
Updated
Apr 24
β’
965
β’
45
β’
2
ajibawa-2023/Audio-Children-Stories-Collection-Large
Viewer
β’
Updated
Apr 1
β’
2.1k
β’
83
β’
8
ajibawa-2023/Audio-Children-Stories-Collection
Viewer
β’
Updated
Mar 27
β’
600
β’
302
β’
6
ajibawa-2023/Software-Architecture
Preview
β’
Updated
Oct 28, 2024
β’
45
β’
27
ajibawa-2023/Software-Architectural-Frameworks
Viewer
β’
Updated
Oct 4, 2024
β’
1.26k
β’
34
β’
9
ajibawa-2023/Maths-College
Viewer
β’
Updated
May 8, 2024
β’
970k
β’
203
β’
50
ajibawa-2023/Maths-Grade-School
Viewer
β’
Updated
May 8, 2024
β’
980k
β’
118
β’
27
ajibawa-2023/Education-College-Students
Viewer
β’
Updated
Apr 10, 2024
β’
254k
β’
166
β’
5
ajibawa-2023/Education-High-School-Students
Viewer
β’
Updated
Apr 10, 2024
β’
255k
β’
28
β’
9
ajibawa-2023/Education-Young-Children
Viewer
β’
Updated
Apr 10, 2024
β’
256k
β’
167
β’
13
ajibawa-2023/Education-Researchers
Viewer
β’
Updated
Apr 10, 2024
β’
255k
β’
15
β’
8
ajibawa-2023/Children-Stories-Collection
Viewer
β’
Updated
Mar 16, 2024
β’
897k
β’
465
β’
53
ajibawa-2023/General-Stories-Collection
Viewer
β’
Updated
Mar 16, 2024
β’
1.07M
β’
346
β’
35
ajibawa-2023/OpenHermes-2.5-Code-290k
Updated
Feb 19, 2024
β’
19
β’
7
ajibawa-2023/Code-290k-ShareGPT
Viewer
β’
Updated
Jan 16, 2024
β’
289k
β’
85
β’
29
ajibawa-2023/Julia-Proof-Pile-2
Viewer
β’
Updated
Dec 26, 2023
β’
293k
β’
13
β’
4
ajibawa-2023/Code-74k-ShareGPT
Viewer
β’
Updated
Dec 8, 2023
β’
73.9k
β’
32
β’
18
ajibawa-2023/SlimOrca-ShareGPT
Viewer
β’
Updated
Nov 14, 2023
β’
518k
β’
18
β’
7
ajibawa-2023/Python-Code-23k-ShareGPT
Viewer
β’
Updated
Nov 11, 2023
β’
22.6k
β’
76
β’
41
ajibawa-2023/Mathjson
Viewer
β’
Updated
Nov 11, 2023
β’
17.1k
β’
4
β’
3