Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Text_trajectory_videogen

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

Paper • 2308.08089 • Published Aug 16, 2023 • 22
CoTracker: It is Better to Track Together

Paper • 2307.07635 • Published Jul 14, 2023 • 18

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 37
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Paper • 2307.02499 • Published Jul 4, 2023 • 15
Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 12

Medical-Related

Models of all types of tasks that relate to medical matters.

DunnBC22/yolos-small-Axial_MRIs

Object Detection • Updated Aug 19, 2023 • 65 • 2
DunnBC22/bert-base-cased-finetuned-ner-BC2GM-IOB

Token Classification • Updated Aug 2, 2023 • 37 • 1
DunnBC22/efficientnet-b5-Brain_Tumors_Image_Classification

Image Classification • Updated Jul 25, 2023 • 31
DunnBC22/vit-base-patch16-224-in21k_lung_and_colon_cancer

Image Classification • Updated Jul 25, 2023 • 2.48k • 4

Text Summarization

DunnBC22/pegasus-multi_news-NewsSummarization_BBC

Summarization • Updated May 12, 2023 • 683 • • 2
DunnBC22/flan-t5-base-text_summarization_data_6_epochs

Summarization • Updated Jul 23, 2023 • 58 • • 2
DunnBC22/flan-t5-base-text_summarization_data

Summarization • Updated May 13, 2023 • 52 • • 2
DunnBC22/led-base-16384-text_summarization_data

Summarization • Updated Jul 24, 2023 • 38 • 1

Running

61

61

Grobid

🌍

Extract bibliographic data from PDFs

LLaSM: Large Language and Speech Model

Paper • 2308.15930 • Published Aug 30, 2023 • 33
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

Paper • 2308.06873 • Published Aug 14, 2023 • 26
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

Paper • 2308.04729 • Published Aug 9, 2023 • 32

Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 76
Deci/DeciCoder-1b

Text Generation • Updated Feb 15, 2024 • 3.51k • 246

VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation

Paper • 2309.00398 • Published Sep 1, 2023 • 22
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Paper • 2307.04725 • Published Jul 10, 2023 • 64
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance

Paper • 2307.00522 • Published Jul 2, 2023 • 32
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Paper • 2309.15091 • Published Sep 26, 2023 • 33

Llm_long_context

YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 68

Previous
1
...
10,186
10,187
10,188
10,189
10,190
...
10,229
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs