Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
gbyuvd
/
Mo2BERTa-proto
like
0
Fill-Mask
English
qwen3
mixture-of-recursions
adaptive-computation
bert
encoder
mlm
research
proof-of-concept
arxiv:
2507.10524
arxiv:
2305.07759
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Mo2BERTa-proto
60 MB
Ctrl+K
Ctrl+K
1 contributor
History:
34 commits
gbyuvd
Update README.md
3b8836c
verified
2 months ago
checkpoints
Upload 8 files
2 months ago
.gitattributes
1.58 kB
Upload TinyStories-valid.txt
2 months ago
MoRBERT.ipynb
1.2 MB
Upload 8 files
2 months ago
README.md
26.1 kB
Update README.md
2 months ago
TinyStories-valid.txt
Safe
19.4 MB
xet
Upload TinyStories-valid.txt
2 months ago
config.json
896 Bytes
Dummy for Download Tracking
2 months ago
training_logs_isodepth.json
72.5 kB
Upload 8 files
2 months ago
training_logs_isoparam.json
82.7 kB
Upload 8 files
2 months ago
training_logs_mor_bert.json
240 kB
Upload 8 files
2 months ago