Scholar
Malte Ostendorff
Google Scholar ID: 8WfhSIcAAAAJ
University of Göttingen / German Research Center for Artificial Intelligence
Large language models
Recommender systems
Information retrieval
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
913
H-index
15
i10-index
20
Publications
20
Co-authors
5
list available
Contact
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
6 items
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data
2026
Cited
0
SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing
2025
Cited
0
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data
2025
Cited
0
MMTEB: Massive Multilingual Text Embedding Benchmark
2025
Cited
0
Data Processing for the OpenGPT-X Model Family
arXiv.org · 2024
Cited
2
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs
arXiv.org · 2024
Cited
8
Resume (English only)
Co-authors
5 total
Georg Rehm
Principal Researcher and Research Fellow, DFKI GmbH
Co-author 2
Pedro Ortiz Suarez
Principal Research Scientist, Common Crawl Foundation
Terry Ruas
University of Göttingen (Prev: Uni. of Michigan, NII Tokyo, Uni. of Wuppertal, UFABC)
Co-author 5
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up