Ilias Ilias
aarabil
·
AI & ML interests
None yet
Recent Activity
new activity
17 days ago
MoritzLaurer/bge-m3-zeroshot-v2.0:[CLS] token representation or Pooled tokens?
reacted
to
MoritzLaurer's
post
with ❤️
28 days ago
#phdone - I defended my PhD yesterday! A key lesson: it is amazing how open science and open source can empower beginners with limited resources:
I first learned about instruction-based classifiers like BERT-NLI 3-4 years ago, through the @HuggingFace ZeroShotClassificationPipeline. Digging deeper into this, it was surprisingly easy to find new datasets, newer base models, and reusable fine-tuning scripts on the HF Hub to create my own zeroshot models - although I didn't know much about fine-tuning at the time.
Thanks to the community effect of the Hub, my models were downloaded hundreds of thousands of times after a few months. Seeing my research being useful for people motivated me to improve and upload newer models. Leaving my contact details in the model cards led to academic cooperation and consulting contracts (and eventually my job at HF).
That's the power of open science & open source: learning, sharing, improving, collaborating.
I mean every word in my thesis acknowledgments (screenshot). I'm very grateful to my supervisors @vanatteveldt @CasAndreu @KasperWelbers for their guidance; to @profAndreaRenda and @CEPS_thinktank for enabling me to work part-time during the first year; to @huggingface for creating awesome tools and an awesome platform; and to many others who are not active on social media.
Links to the full thesis and the collection of my most recent models are below.
PS: If someone happens to speak Latin, let me know if my diploma contains some hidden Illuminati code or something :D
Organizations
None yet
aarabil's activity
[CLS] token representation or Pooled tokens?
#8 opened 17 days ago
by
aarabil
How did you train m3-retromae?
3
#66 opened 6 months ago
by
hotchpotch
Languages
4
#6 opened 8 months ago
by
sarangs
Output does not make sense
2
#1 opened 7 months ago
by
aarabil
Add benchmark to MTEB
6
#7 opened 11 months ago
by
sam-gab
Multilabel binary classification
2
#3 opened 9 months ago
by
aarabil
Binary classification
2
#2 opened 9 months ago
by
aarabil
Multilabel binary classification
2
#3 opened 9 months ago
by
aarabil
Binary classification
2
#2 opened 9 months ago
by
aarabil