Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Emet Research
EmetTheGolum
Follow
0 followers
·
11 following
https://emetresearch.ai/
Freeman_Lewin
freemanlewin
AI & ML interests
Data and Data Acquisition - A partner, not a vendor. Maintainer of Proprietary Datasets. NLP, Multimodal data processing and sharing
Recent Activity
reacted
to
fdaudens
's
post
with ❤️
19 days ago
Reminder: Don’t. Use. ChatGPT. As. A. Calculator. Seriously. 🤖 Loved listening to @sasha on Hard Fork—it really made me think. A few takeaways that hit home: - Individual culpability only gets you so far. The real priority: demanding accountability and transparency from companies. - Evaluate if generative AI is the right tool for certain tasks (like search) before using it. Curious about the full conversation? https://www.nytimes.com/2025/01/17/podcasts/hardfork-tiktok-rednote-environment.html. Give it a listen—it’s worth it! 🌍
reacted
to
ezgikorkmaz
's
post
with 🚀
19 days ago
If you are interested in reinforcement learning, a recent paper I wrote introduces foundational analysis on deep reinforcement learning decision making and representations learnt by it. Link: https://bsky.app/profile/ezgikorkmaz.bsky.social/post/3lfpgsrn6sc2m
reacted
to
cfahlgren1
's
post
with ❤️
2 months ago
You can clean and format datasets entirely in the browser with a few lines of SQL. In this post, I replicate the process @mlabonne used to clean the new https://huggingface.co/datasets/microsoft/orca-agentinstruct-1M-v1 dataset. The cleaning process consists of: - Joining the separate splits together / add split column - Converting string messages into list of structs - Removing empty system prompts https://huggingface.co/blog/cfahlgren1/the-beginners-guide-to-cleaning-a-dataset Here's his new cleaned dataset: https://huggingface.co/datasets/mlabonne/orca-agentinstruct-1M-v1-cleaned
View all activity
Organizations
models
None public yet
datasets
1
EmetTheGolum/Test
Updated
Nov 22, 2024
•
36