view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 By tomaarsen • May 28, 2024 • 242
Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning Paper • 2507.16784 • Published 27 days ago • 116
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models Paper • 2507.14241 • Published Jul 17 • 17
DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts Paper • 2507.18464 • Published 25 days ago • 11
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm Paper • 2507.18553 • Published 25 days ago • 39
Diversity-Enhanced Reasoning for Subjective Questions Paper • 2507.20187 • Published 22 days ago • 23
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment Paper • 2507.20984 • Published 21 days ago • 54
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published 21 days ago • 79
Persona Vectors: Monitoring and Controlling Character Traits in Language Models Paper • 2507.21509 • Published 20 days ago • 29
Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models Paper • 2508.00819 • Published 17 days ago • 62
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report Paper • 2508.01059 • Published 16 days ago • 31
AttnTrace: Attention-based Context Traceback for Long-Context LLMs Paper • 2508.03793 • Published 13 days ago • 2
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools? Paper • 2508.01780 • Published 15 days ago • 13
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward Paper • 2508.03686 • Published 13 days ago • 32
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published 14 days ago • 123
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion Paper • 2508.04440 • Published 12 days ago • 8