4 21 8

Paul Teiletche

paultltc

AI & ML interests

None yet

Recent Activity

liked a Space 5 days ago

HuggingFaceTB/smol-training-playbook

upvoted an article 14 days ago

Sentence Transformers is joining Hugging Face!

updated a Space 19 days ago

ModernVBERT/README

View all activity

Organizations

upvoted an article 14 days ago

Article

Sentence Transformers is joining Hugging Face!

14 days ago

• 72

upvoted an article 23 days ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

Oct 1

• 120

upvoted a paper about 1 month ago

ModernVBERT: Towards Smaller Visual Document Retrievers

Paper • 2510.01149 • Published Oct 1 • 30

upvoted 4 articles 4 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 705

Article

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

Jul 16

• 74

Article

Introducing ColQwen-Omni: Retrieve in every modality

and 4 others •

Jul 17

• 75

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 712

upvoted a paper 4 months ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1 • 78

upvoted a collection 4 months ago

ERNIE 4.5

Collection

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 26 items • Updated Sep 24 • 174

upvoted an article 7 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 58

upvoted a paper 7 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 117

upvoted 5 articles 8 months ago

Article

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

and 1 other •

Mar 21

• 37

Article

ViDoRe Benchmark V2: Raising the Bar for Visual Retrieval

and 2 others •

Mar 18

• 12

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 186

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 381

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 1.14k

upvoted a paper 12 months ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 40

upvoted a collection about 1 year ago

Parallel Sentences Datasets

Collection

These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Feb 25 • 19

upvoted an article about 1 year ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 297

upvoted a collection over 1 year ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 864

Paul Teiletche

AI & ML interests

Recent Activity

Organizations

paultltc's activity

Sentence Transformers is joining Hugging Face!

Introducing RTEB: A New Standard for Retrieval Evaluation

Finally, a Replacement for BERT: Introducing ModernBERT

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

Introducing ColQwen-Omni: Retrieve in every modality

SmolLM3: smol, multilingual, long-context reasoner

Efficient LLM Pretraining: Packed Sequences and Masked Attention

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

ViDoRe Benchmark V2: Raising the Bar for Visual Retrieval

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

SmolVLM - small yet mighty Vision Language Model

Introducing smolagents: simple agents that write actions in code.

ColPali: Efficient Document Retrieval with Vision Language Models 👀