🤖 AI Summary
Health informatics research faces significant challenges in multimodal data fusion, rapid domain knowledge evolution, and high demands for cross-disciplinary integration; current large language model (LLM)-based agents exhibit notable limitations in medical visual understanding and domain-specific quality assurance. This paper introduces the first fully autonomous research agent system tailored for health informatics, built upon a multi-agent architecture integrating literature parsing, data modeling, code generation, LaTeX authoring, vision-language understanding, and standardized validation modules. We innovatively incorporate a vision-language feedback mechanism and a domain-specific quality control protocol to enable semantic interpretation of medical images and ensure end-to-end traceability. The system autonomously executes the entire research pipeline—from systematic literature review and statistical analysis to manuscript drafting—producing academically compliant, reproducible, and submission-ready LaTeX papers. This advances research efficiency, methodological transparency, and clinical applicability.
📝 Abstract
Health informatics research is characterized by diverse data modalities, rapid knowledge expansion, and the need to integrate insights across biomedical science, data analytics, and clinical practice. These characteristics make it particularly well-suited for agent-based approaches that can automate knowledge exploration, manage complex workflows, and generate clinically meaningful outputs. Recent progress in large language model (LLM)-based agents has demonstrated promising capabilities in literature synthesis, data analysis, and even end-to-end research execution. However, existing systems remain limited for health informatics because they lack mechanisms to interpret medical visualizations and often overlook domain-specific quality requirements. To address these gaps, we introduce OpenLens AI, a fully automated framework tailored to health informatics. OpenLens AI integrates specialized agents for literature review, data analysis, code generation, and manuscript preparation, enhanced by vision-language feedback for medical visualization and quality control for reproducibility. The framework automates the entire research pipeline, producing publication-ready LaTeX manuscripts with transparent and traceable workflows, thereby offering a domain-adapted solution for advancing health informatics research.