Databricks

company

Verified

https://www.databricks.com

databricks

databricks

AI & ML interests

None defined yet.

abhaygupta

authored a paper 2 months ago

$μ$nit Scaling: Simple and Scalable FP8 LLM Training

Paper • 2502.05967 • Published Feb 9

mylesbaker

in databricks/databricks-dolly-15k 6 months ago

Your employees were clearly bored

#18 opened 6 months ago by

sam-mosaic

authored 2 papers 9 months ago

LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms

Paper • 2311.13133 • Published Nov 22, 2023

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

Paper • 2312.17482 • Published Dec 29, 2023 • 1

abhaygupta

authored a paper about 1 year ago

DAiSEE: Towards User Engagement Recognition in the Wild

Paper • 1609.01885 • Published Sep 7, 2016

Ziyi-Yang

authored 4 papers about 1 year ago

Unifying Vision, Text, and Layout for Universal Document Processing

Paper • 2212.02623 • Published Dec 5, 2022 • 11

i-Code Studio: A Configurable and Composable Framework for Integrative AI

Paper • 2305.13738 • Published May 23, 2023 • 1

CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation

Paper • 2311.18775 • Published Nov 30, 2023 • 6

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257

ankner

authored a paper about 1 year ago

Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion

Paper • 2406.11196 • Published Jun 17, 2024 • 8

Ziyi-Yang

authored a paper about 1 year ago

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Paper • 2405.19332 • Published May 29, 2024 • 23

sam-mosaic

authored a paper over 1 year ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

abhaygupta

authored 4 papers over 1 year ago

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Paper • 2206.14098 • Published Jun 28, 2022

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Paper • 2303.10464 • Published Mar 18, 2023 • 1

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Paper • 2303.11525 • Published Mar 21, 2023 • 1

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7

ankner

authored 3 papers over 1 year ago

Dynamic Masking Rate Schedules for MLM Pretraining

Paper • 2305.15096 • Published May 24, 2023

Striped Attention: Faster Ring Attention for Causal Transformers

Paper • 2311.09431 • Published Nov 15, 2023 • 4

3D Neural Field Generation using Triplane Diffusion

Paper • 2211.16677 • Published Nov 30, 2022

abhi-db

updated a model over 1 year ago

databricks/dbrx-instruct

Text Generation • 132B • Updated Apr 19, 2024 • 11.4k • 1.12k