Self-Training on Image Comprehension

https://stic-lvlm.github.io/

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lupantech authored a paper 21 days ago

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

lupantech authored a paper 21 days ago

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models

lupantech authored a paper 21 days ago

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

View all activity

STIC-LVLM's activity

lupantech

authored 19 papers 21 days ago

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

Paper • 2303.16199 • Published Mar 28, 2023 • 4

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models

Paper • 2304.09842 • Published Apr 19, 2023 • 1

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

Paper • 2310.02255 • Published Oct 3, 2023 • 2

SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models

Paper • 2307.10635 • Published Jul 20, 2023 • 9

Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering

Paper • 2209.09513 • Published Sep 20, 2022 • 1

Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning

Paper • 2209.14610 • Published Sep 29, 2022

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

Paper • 2405.19716 • Published May 30, 2024

A Survey of Deep Learning for Mathematical Reasoning

Paper • 2212.10535 • Published Dec 20, 2022

UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression

Paper • 2212.02746 • Published Dec 6, 2022

Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning

Paper • 2105.04165 • Published May 10, 2021

VDebugger: Harnessing Execution Feedback for Debugging Visual Programs

Paper • 2406.13444 • Published Jun 19, 2024

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Paper • 2410.08182 • Published Oct 10, 2024

Lila: A Unified Benchmark for Mathematical Reasoning

Paper • 2210.17517 • Published Oct 31, 2022

VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Paper • 2412.02172 • Published Dec 3, 2024

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

Paper • 2502.11271 • Published 23 days ago • 16

ydeng9

authored a paper 30 days ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 22

AI & ML interests

Recent Activity

Team members 2

STIC-LVLM's activity