joy larkin's picture
1 7

joy larkin

joylarkin

AI & ML interests

Global AI, Multilingual AI, European AI, AGI, ASI, AI Data, Datasets, Data workflows, LLMs, Fine-tuning, Evaluations, etc. โ€ขโ€ขโ€ข Head of Marketing/GTM @ Airtrain AI.

Recent Activity

Organizations

Social Post Explorers's profile picture Airtrain AI's profile picture

joylarkin's activity

replied to m-ric's post 21 days ago
view reply

Ok, as someone who communicates to technical audiences, this is a great tool. The only other stats tool for PyPI I'm aware of doesn't measure cumulative package performance like this does.

For everyone else, share this with the people who do comms (internal or external) for your team: marketers, product marketers, devrel, community, etc.

reacted to merve's post with ๐Ÿ”ฅ about 1 month ago
view post
Post
1967
Amazing past days at open ML, it's raining coding models, let's have a recap ๐ŸŒง๏ธ Find all models and datasets here merve/nov-15-releases-67372d0ebdc354756a52ecd0

Models
๐Ÿ’ป Coding: Qwen team released two Qwen2.5-Coder checkpoints of 32B and 7B. Infly released OpenCoder: 1.5B and 8B coding models with instruction SFT'd versions and their datasets! ๐Ÿ’—

๐Ÿ–ผ๏ธ Image/Video Gen: Alibaba vision lab released In-context LoRA -- 10 LoRA models on different themes based on Flux. Also Mochi the sota video generation model with A2.0 license now comes natively supported in diffusers ๐Ÿ‘

๐Ÿ–ผ๏ธ VLMs/Multimodal: NexaAIDev released Omnivision 968M a new vision language model aligned with DPO for reducing hallucinations, also comes with GGUF ckpts ๐Ÿ‘ Microsoft released LLM2CLIP, a new CLIP-like model with longer context window allowing complex text inputs and better search

๐ŸŽฎ AGI?: Etched released Oasis 500M, a diffusion based open world model that takes keyboard input and outputs gameplay ๐Ÿคฏ

Datasets
Common Corpus: A text dataset with 2T tokens with permissive license for EN/FR on various sources: code, science, finance, culture ๐Ÿ“–
posted an update 3 months ago
view post
Post
2628
๐Ÿ’ฌ Chat as a way to query SQL! The Airtrain AI team is happy to share a new Hugging Face Space that lets you interact with Hugging Face Hub datasets using a natural language chatbot. ๐Ÿค—

Start Exploring ๐Ÿ‘‰ airtrain-ai/hf-dataset-chat-to-sql

This Space is forked from davidberenstein1957/text-to-sql-hub-datasetsย byย  @davidberenstein1957 and features chat capability with improved table naming. The tool works with Hugging Faceโ€™s recently released in-browser DuckDB-based SQL query engine for datasets.