Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Robert Csordas's picture

Robert Csordas

robertcsordas
sandorkonya's profile picture
·
https://robertcsordas.github.io/
  • robert_csordas
  • robertcsordas
  • robertcsordas
  • robertcsordas.bsky.social

AI & ML interests

Systematic generalization, algorithmic reasoning

Organizations

None yet

authored 5 papers 8 months ago

Randomized Positional Encodings Boost Length Generalization of Transformers

Paper • 2305.16843 • Published May 26, 2023 • 2

Mindstorms in Natural Language-Based Societies of Mind

Paper • 2305.17066 • Published May 26, 2023 • 3

MoEUT: Mixture-of-Experts Universal Transformers

Paper • 2405.16039 • Published May 25, 2024 • 2

MrT5: Dynamic Token Merging for Efficient Byte-level Language Models

Paper • 2410.20771 • Published Oct 28, 2024 • 3

A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Paper • 2202.05780 • Published Feb 11, 2022
authored a paper almost 2 years ago

SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41
authored a paper about 2 years ago

Approximating Two-Layer Feedforward Networks for Efficient Transformers

Paper • 2310.10837 • Published Oct 16, 2023 • 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs