Scholar

Ruoxi Sun

Google Scholar ID: ut1-7LAAAAAJ

Columbia University, Google

machine learningstatisticscomputational biology

Citations & Impact

All-time

Citations

4,566

H-index

i10-index

Publications

Co-authors

Contact

Publications

20 items

Browse publications on Google Scholar (top-right) ↗

Resume (English only)

Academic Achievements

Published multiple papers in top conferences such as ICLR and NeurIPS. Specific publications include:
- Learn-by-interact: Synthesize Large-scale Agent Data with Trajectories by Interacting with Environments
- Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
- Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models
- From Few to Many: Enhancing Many-Shot In-Context Learning with Optimized Example Selection and Expansion
- CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL
- SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
- Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training
- BRIGHT: a realistic Benchmark for ReasonInG-Heavy reTrieval
- Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization
- Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
- Chain of Agents: Large Language Models Collaborating on Long-Context Tasks
- SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL
- Capabilities of Gemini Models in Medicine

Research Experience

Senior research scientist at Google Cloud AI Research, involved in multiple research projects including evaluating text-to-SQL workflows and synthesizing large-scale agent data.

Education

Background

Research interests include large language models for code generation, agents, and factuality. Currently a senior research scientist at Google Cloud AI Research.

Miscellany

Co-authors

0 total

Co-authors: 0 (list not available)