-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 5.79k • 55 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 399k • 116 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.29M • • 525 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 446k • 202
Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked
a dataset
about 11 hours ago
microsoft/PatientSafetyBench
liked
a model
about 11 hours ago
nvidia/magpie_tts_multilingual_357m
liked
a dataset
about 11 hours ago
facebook/map-anything
Organizations
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 5.79k • 55 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 399k • 116 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.29M • • 525 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 446k • 202
Code RL Datasets
spaces
6
Sleeping
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Sleeping
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets
15
adorkin/tulu-3-sft-mixture
Viewer
•
Updated
•
939k
•
1
adorkin/extended_tweet_emojis
Viewer
•
Updated
•
52.7k
•
66
•
3
adorkin/cosmopedia-v2-translate-append-instructions-et
Viewer
•
Updated
•
6.85k
•
18
adorkin/flan-v2-converted-en
Viewer
•
Updated
•
58.2k
•
10
adorkin/mala-bilingual-et-en-scores
Viewer
•
Updated
•
50.9M
•
49
adorkin/dclm-sample-13k-en-et-translation
Viewer
•
Updated
•
13.7k
•
10
adorkin/nllb-et-en-scores
Viewer
•
Updated
•
22M
•
20
adorkin/Magpie-Llama-3.1-Pro-300K-Filtered-18K-sample-et
Viewer
•
Updated
•
36.6k
•
21
•
1
adorkin/general-instruction-augmented-corpora
Viewer
•
Updated
•
20M
•
276
•
1
adorkin/dbpedia-entity-est
Viewer
•
Updated
•
4.69M
•
26