alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

liked a model 1 day ago

deepseek-ai/DeepSeek-V3-0324

liked a model 1 day ago

DevQuasar/analytical_reasoning_r16a32_unsloth-Llama-3.2-3B-Instruct-bnb-4bit

reacted to openfree's post with ❤️ 1 day ago

🚀 DeepSeek V3-0324 + Real-time Research Power! 🌐 Hello there! Today I'm excited to introduce an amazing tool based on the DeepSeek V3-0324 latest model. This isn't just another AI chatbot—it's a true "research assistant" capable of real-time information retrieval and analysis! https://huggingface.co/spaces/openfree/Deepseek-v3-0324-Research 🧠 Key Strengths of DeepSeek V3-0324 DeepSeek V3-0324, provided by Fireworks AI, comes with these powerful advantages: 🎯 Superior Reasoning: Excellent ability to solve complex problems step-by-step 📚 Extensive Knowledge: Deep understanding across various topics from comprehensive training 🧩 Context Awareness: Maintains long conversation contexts for consistent responses 🌍 Multilingual Support: Processes various languages effectively 🔎 Added Real-time "Deep Research" Capability! The most exciting feature of this project is the implementation of real-time search functionality similar to ChatGPT's Browse with Bing or Perplexity AI! 🌟 How does it work? 📋 Query Analysis: Analyzes questions to automatically extract optimal search keywords 🌐 Web Search: Utilizes advanced search technology to retrieve the latest information 🧪 Result Analysis: Intelligently analyzes search results and evaluates relevance 💡 Comprehensive Response: Combines freshly retrieved information with AI's existing knowledge Key Benefits: ⏱️ Up-to-date Information: Always provides the latest data through real-time web searches 📊 Enhanced Reliability: Improves trustworthiness by citing information sources 🔄 Overcoming Knowledge Limitations: Handles questions beyond the AI's training cutoff 🛠️ Research Efficiency: Processes everything from information retrieval to analysis in one go 🖥️ How to Use It's simple! Just enable the "Deep Research" checkbox and ask your question. The AI will automatically search for and analyze relevant information to provide rich, informed answers.

View all activity

Organizations

AtAndDev's activity

upvoted a paper 6 days ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published 10 days ago • 29

upvoted a collection 6 days ago

Gemma 2 Release

15 items • Updated 14 days ago • 217

upvoted a paper 7 days ago

Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation

Paper • 2503.13070 • Published 9 days ago • 9

upvoted a collection 10 days ago

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 9 hours ago • 43

upvoted a paper 11 days ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published Feb 6 • 30

upvoted 2 articles 11 days ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

16 days ago

• 68

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

15 days ago

• 345

upvoted a paper 12 days ago

Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models

Paper • 2503.09669 • Published 14 days ago • 34

upvoted a collection 14 days ago

Gemma 3 Release

9 items • Updated 13 days ago • 294

upvoted 2 collections 2 months ago

DeepSeek-R1

8 items • Updated Jan 21 • 593

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 80

upvoted a paper 2 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 101

upvoted a collection 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 28 days ago • 572

upvoted an article 2 months ago

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27, 2024

• 129

upvoted a paper 2 months ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published Jan 16 • 48