joy larkin's picture
1 7

joy larkin

joylarkin

AI & ML interests

Global AI, Multilingual AI, European AI, AGI, ASI, AI Data, Datasets, Data workflows, LLMs, Fine-tuning, Evals, etc. ••• AI Marketer, Evangelist, Technologist ••• Head of Marketing/GTM @ Airtrain AI.

Recent Activity

Organizations

Social Post Explorers's profile picture Airtrain AI's profile picture

joylarkin's activity

replied to m-ric's post about 2 months ago
view reply

Ok, as someone who communicates to technical audiences, this is a great tool. The only other stats tool for PyPI I'm aware of doesn't measure cumulative package performance like this does.

For everyone else, share this with the people who do comms (internal or external) for your team: marketers, product marketers, devrel, community, etc.

reacted to merve's post with 🔥 2 months ago
view post
Post
1974
Amazing past days at open ML, it's raining coding models, let's have a recap 🌧️ Find all models and datasets here merve/nov-15-releases-67372d0ebdc354756a52ecd0

Models
💻 Coding: Qwen team released two Qwen2.5-Coder checkpoints of 32B and 7B. Infly released OpenCoder: 1.5B and 8B coding models with instruction SFT'd versions and their datasets! 💗

🖼️ Image/Video Gen: Alibaba vision lab released In-context LoRA -- 10 LoRA models on different themes based on Flux. Also Mochi the sota video generation model with A2.0 license now comes natively supported in diffusers 👏

🖼️ VLMs/Multimodal: NexaAIDev released Omnivision 968M a new vision language model aligned with DPO for reducing hallucinations, also comes with GGUF ckpts 👏 Microsoft released LLM2CLIP, a new CLIP-like model with longer context window allowing complex text inputs and better search

🎮 AGI?: Etched released Oasis 500M, a diffusion based open world model that takes keyboard input and outputs gameplay 🤯

Datasets
Common Corpus: A text dataset with 2T tokens with permissive license for EN/FR on various sources: code, science, finance, culture 📖
posted an update 4 months ago
view post
Post
2630
💬 Chat as a way to query SQL! The Airtrain AI team is happy to share a new Hugging Face Space that lets you interact with Hugging Face Hub datasets using a natural language chatbot. 🤗

Start Exploring 👉 airtrain-ai/hf-dataset-chat-to-sql

This Space is forked from davidberenstein1957/text-to-sql-hub-datasets by  @davidberenstein1957 and features chat capability with improved table naming. The tool works with Hugging Face’s recently released in-browser DuckDB-based SQL query engine for datasets.