Scholar

Benjamin Coleman

Google Scholar ID: fInuVkEAAAAJ

Google DeepMind

Machine LearningData Structures and Algorithms

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,551

H-index

i10-index

Publications

Co-authors

list available

Contact

GitHubOpen ↗

Publications

8 items

PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents

2026

Cited

AutoCSF: Provably Space-Efficient Indexing of Skewed Key-Value Workloads via Filter-Augmented Compressed Static Functions

2026

Cited

AgenticTagger: Structured Item Representation for Recommendation with LLM Agents

2026

Cited

PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

2026

Cited

Breaking the Curse of Dimensionality: On the Stability of Modern Vector Retrieval

2025

Cited

Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory

2025

Cited

Gemma 3 Technical Report

2025

Cited

Down with the Hierarchy: The 'H' in HNSW Stands for "Hubs"

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

One-Pass Diversified Sampling with Application to Terabyte-Scale Genomic Sequence Streams (ICML22)
Practical Near Neighbor Search via Group Testing (NeurIPS21, Spotlight Talk - Top 3%)
A One-Pass Distributed and Private Sketch for Kernel Sums with Applications to Machine Learning at Scale (CCS21)
Sub-linear RACE Sketches for Approximate Kernel Density Estimation on Streaming Data (WWW20)
Sub-linear Memory Sketches for Near Neighbor Search on Streaming Data (ICML20)
Revisiting Consistent Hashing with Bounded Loads (AAAI21)
Fast processing and querying of 170TB of genomics data via a repeated and merged bloom filter (RAMBO) (SIGMOD21)

Research Experience

Current work focuses on efficient approximate algorithms for low-level building blocks of machine learning, such as kernel sums and near-neighbor search, as well as fast training and inference.

Background

Interested in randomized algorithms for scalable machine learning. By replacing expensive exact algorithms with lightweight approximate methods, the resources needed to run a program can be substantially reduced. Particularly interested in simple methods with theoretical guarantees that also work well in a web-scale production environment.

Co-authors

4 total

Anshumali Shrivastava

Rice University, ThirdAI Corp.

Co-author 2

Co-author 3

Richard Baraniuk

C. Sidney Burrus Professor, Electrical & Computer Engineering, Rice University