Alexey M

bzikst

AI & ML interests

None yet

Recent Activity

View all activity

Organizations

None yet

bzikst's activity

reacted to merve's post with πŸ‘ 4 days ago
view post
Post
4446
Your weekly recap of open AI is here, and it's packed with models! merve/feb-14-releases-67af876b404cc27c6d837767

πŸ‘€ Multimodal
> OpenGVLab released InternVideo 2.5 Chat models, new video LMs with long context
> AIDC released Ovis2 model family along with Ovis dataset, new vision LMs in different sizes (1B, 2B, 4B, 8B, 16B, 34B), with video and OCR support
> ColQwenStella-2b is a multilingual visual retrieval model that is sota in it's size
> Hoags-2B-Exp is a new multilingual vision LM with contextual reasoning, long context video understanding

πŸ’¬ LLMs
A lot of math models!
> Open-R1 team released OpenR1-Math-220k large scale math reasoning dataset, along with Qwen2.5-220K-Math fine-tuned on the dataset, OpenR1-Qwen-7B
> Nomic AI released new Nomic Embed multilingual retrieval model, a MoE with 500 params with 305M active params, outperforming other models
> DeepScaleR-1.5B-Preview is a new DeepSeek-R1-Distill fine-tune using distributed RL on math
> LIMO is a new fine-tune of Qwen2.5-32B-Instruct on Math

πŸ—£οΈ Audio
> Zonos-v0.1 is a new family of speech recognition models, which contains the model itself and embeddings

πŸ–ΌοΈ Vision and Image Generation
> We have ported DepthPro of Apple to transformers for your convenience!
> illustrious-xl-v1.0 is a new illustration generation model
Β·
replied to merve's post 4 days ago
view reply

As I can see, Zonos is text-to-speech model, not ASR

New activity in bzikst/faster-whisper-large-v3-ru-podlodka 5 months ago

Podlodka int8 version

6
#1 opened 5 months ago by
saintman
New activity in bzikst/faster-whisper-large-v3-russian 5 months ago

int-8 version

3
#1 opened 7 months ago by
saintman
New activity in audeering/wav2vec2-large-robust-24-ft-age-gender about 1 year ago

Small fix in example usage

1
#3 opened about 1 year ago by
bzikst