Scholar
Alexandre Marques
Google Scholar ID: p9zb2Y0AAAAJ
Neural Magic
deep learning
pruning
quantization
multi-fidelity analysis
embedded boundary
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
541
H-index
13
i10-index
17
Publications
20
Co-authors
0
Contact
No contact links provided.
Publications
4 items
An Interpretable Latency Model for Speculative Decoding in LLM Serving
2026
Cited
0
Training Machine Learning Models on Encrypted Data: A Privacy-Preserving Framework using Homomorphic Encryption
2026
Cited
0
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
2025
Cited
0
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
arXiv.org · 2024
Cited
0
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up