Nota AI

company

Verified

https://www.nota.ai/

nota_ai

nota-github

Activity Feed Request to join this org

AI & ML interests

Hardware-aware AI Model Optimization

Recent Activity

ChanilP updated a collection 8 days ago

Efficient Large Language Model

hancheolp authored a paper about 1 month ago

Assessing the Answerability of Queries in Retrieval-Augmented Code Generation

hancheolp authored a paper 5 months ago

Self-Knowledge Distillation for Learning Ambiguity

View all activity

nota-ai's activity

ChanilP

updated a collection 8 days ago

Efficient Large Language Model

Shortened LLMs from Depth Pruning; https://github.com/Nota-NetsPresso/shortened-llm • 15 items • Updated 8 days ago • 4

hancheolp

authored a paper about 1 month ago

Assessing the Answerability of Queries in Retrieval-Augmented Code Generation

Paper • 2411.05547 • Published Nov 8

hancheolp

authored a paper 5 months ago

Self-Knowledge Distillation for Learning Ambiguity

Paper • 2406.09719 • Published Jun 14

thibcastells

authored a paper 8 months ago

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

Paper • 2402.02834 • Published Feb 5 • 14

ktho22

authored a paper 8 months ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18 • 21

skchoi

authored a paper 8 months ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18 • 21

tairenpiao

authored a paper 8 months ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18 • 21

thibcastells

authored a paper 8 months ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18 • 21

bokyeong1015

authored 2 papers 8 months ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18 • 21

LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights

Paper • 2404.11936 • Published Apr 18 • 1

geonmin-kim

authored a paper 11 months ago

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

Paper • 2402.02834 • Published Feb 5 • 14

illian01

authored a paper 11 months ago

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

Paper • 2402.02834 • Published Feb 5 • 14

bokyeong1015

authored a paper 11 months ago

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

Paper • 2402.02834 • Published Feb 5 • 14

ktho22

authored a paper 11 months ago

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

Paper • 2402.02834 • Published Feb 5 • 14

ktho22

authored a paper about 1 year ago

MobileSAMv2: Faster Segment Anything to Everything

Paper • 2312.09579 • Published Dec 15, 2023 • 20

thibcastells

authored a paper about 1 year ago

Automatic Neural Network Pruning that Efficiently Preserves the Model Accuracy

Paper • 2111.09635 • Published Nov 18, 2021 • 1

hancheolp

authored 2 papers about 1 year ago

A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation

Paper • 2304.00471 • Published Apr 2, 2023 • 1

Deep Model Compression Also Helps Models Capture Ambiguity

Paper • 2306.07061 • Published Jun 12, 2023

skchoi

authored 2 papers over 1 year ago

A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation

Paper • 2304.00471 • Published Apr 2, 2023 • 1

On Architectural Compression of Text-to-Image Diffusion Models

Paper • 2305.15798 • Published May 25, 2023 • 4