Follow
Andrea Santilli
Andrea Santilli
Sr. Research Engineer @ NVIDIA
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
TMLR 2022 - Transactions on Machine Learning Research, 2023
26572023
Multitask prompted training enables zero-shot task generalization
V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ...
ICLR 2022 - International Conference on Learning Representations, 2022
25502022
Bloom: A 176b-parameter open-access multilingual language model
BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ...
arXiv preprint arXiv:2211.05100, 2022
24712022
Promptsource: An integrated development environment and repository for natural language prompts
SH Bach, V Sanh, ZX Yong, A Webson, C Raffel, NV Nayak, A Sharma, ...
ACL 2022 (Demo) - Proceedings of the 60th Annual Meeting of the Association …, 2022
4532022
Accelerating Transformer Inference for Translation via Parallel Decoding
A Santilli, S Severino, E Postolache, V Maiorca, M Mancusi, R Marin, ...
ACL 2023 (Main) - Proceedings of the 61st Annual Meeting of the Association …, 2023
1372023
KERMIT: Complementing transformer architectures with encoders of explicit syntactic interpretations
FM Zanzotto, A Santilli, L Ranaldi, D Onorati, P Tommasino, F Fallucchi
EMNLP 2020 (Main) - Proceedings of the 2020 conference on empirical methods …, 2020
732020
Camoscio: An italian instruction-tuned llama
A Santilli, E Rodolà
🏆 CLiC-it 2023 - Proceedings of the 9th Italian Conference on Computational …, 2023
552023
Preserving privacy in large language models: A survey on current threats and solutions
M Miranda, ES Ruzzetti, A Santilli, FM Zanzotto, S Bratières, E Rodolà
TMLR 2024 - Transactions on Machine Learning Research, 2024
492024
Fauno: The Italian Large Language Model that will leave you senza parole!
A Bacciu, G Trappolini, A Santilli, E Rodolà, F Silvestri
IIR 2023 - Proceedings of the 13th Italian Information Retrieval Workshop, 2023
372023
Efficient and effective uncertainty quantification for LLMs
M Xiong, A Santilli, M Kirchhof, A Golinski, S Williamson
Neurips Safe Generative AI Workshop 2024, 2024
252024
Multimodal Neural Databases
G Trappolini, A Santilli, E Rodolà, A Halevy, F Silvestri
ACM SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference …, 2023
202023
Latent autoregressive source separation
E Postolache, G Mariani, M Mancusi, A Santilli, L Cosmo, E Rodola
AAAI 2023 - Proceedings of the AAAI Conference on Artificial Intelligence 37 …, 2023
182023
Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results
A Santilli, A Golinski, M Kirchhof, F Danieli, A Blaas, M Xiong, L Zappella, ...
ACL 2025 (Main), 2025
162025
Language models are injective and hence invertible
G Nikolaou, T Mencattini, D Crisostomi, A Santilli, Y Panagakis, E Rodolà
ICLR 2026 - International Conference on Learning Representations, 2025
132025
MERGE : Efficient Evolutionary Merging on Consumer-grade GPUs
T Mencattini, AR Minut, D Crisostomi, A Santilli, E Rodola
ICML 2025, 2025
132025
A kernel-based approach for irony and sarcasm detection in Italian
A Santilli, D Croce, R Basili
EVALITA Evaluation of NLP and Speech Tools for Italian 12, 146, 2018
92018
KERMITviz: Visualizing Neural Network Activations on Syntactic Trees
L Ranaldi, F Fallucchi, A Santilli, FM Zanzotto
Research Conference on Metadata and Semantics Research, 139-147, 2021
82021
Unsupervised source separation via Bayesian inference in the latent domain
M Mancusi, E Postolache, G Mariani, M Fumero, A Santilli, L Cosmo, ...
arXiv preprint arXiv:2110.05313, 2021
82021
Escaping Plato's Cave: Towards the Alignment of 3D and Text Latent Spaces
S Hadgi, L Moschella, A Santilli, D Gomez, Q Huang, E Rodolà, S Melzi, ...
CVPR 2025 - Proceedings of the IEEE/CVF Conference on Computer Vision and …, 2025
62025
Explanatory learning: Beyond empiricism in neural networks
A Norelli, G Mariani, L Moschella, A Santilli, G Parascandolo, S Melzi, ...
arXiv preprint arXiv:2201.10222, 2022
62022
The system can't perform the operation now. Try again later.
Articles 1–20