Professor Aline Villavicencio

MPhil, PhD

School of Computer Science

Chair in Natural Language Processing

Director of Equality, Diversity and Inclusion

Member of the Natural Language Processing research group

Aline Villavicencio profile photo
Profile picture of Aline Villavicencio profile photo
a.villavicencio@sheffield.ac.uk
+44 114 222 1860

Full contact details

Professor Aline Villavicencio
School of Computer Science
Regent Court (DCS)
211 Portobello
91Ö±²¥
S1 4DP
Profile

Aline Villavicencio received her PhD and MPhil degrees from the University of Cambridge (UK) and MSc in Computer Science from the Federal University of Rio Grande do Sul (Brazil).

She was a Visiting Scholar at the Massachusetts Institute of Technology (USA) (in the Department of Linguistics and Philosophy in 2014/2015 and in the Laboratory of Information and Decision Systems in 2011/2012) at the Labo­ra­toire LaTTiCe at the École Normale Supé­rieure (France) in 2014, an Erasmus-Mundus Visiting Scholar at Saarland University (Germany) in 2012/2013, and at the University of Bath in 2006-2009.

From 2007-2017 she held a Research Fellowship from the Brazilian Scientific Research Council (CNPq). She is also affiliated to the Federal University of Rio Grande do Sul (Brazil)

Research interests

Her research interests are in lexical semantics, multilinguality, and cognitively motivated NLP. This work includes techniques for Multiword Expression treatment using statistical methods and distributional semantic models, and applications like Text Simplification and Question Answering, for languages like English and Portuguese.

Publications

Books

  • Poibeau T & Villavicencio A (2017) . Cambridge University Press. RIS download Bibtex download
  • Villavicencio A, Poibeau T, Korhonen A & Alishahi A (2013) Cognitive Aspects of Computational Language Acquisition. Springer Science & Business Media. RIS download Bibtex download
  • Caseli HDM, Villavicencio A, Teixeira A & Perdigão F (2012) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface. RIS download Bibtex download
  • (2002) . Vandenhoeck & Ruprecht. RIS download Bibtex download

Edited books

  • Bansal M & Villavicencio A (Ed.) (2019) Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL). Association for Computational Linguistics. RIS download Bibtex download
  • Villavicencio A, Moreira V, Abad A, Caseli H, Gamallo P, Ramisch C, Oliveira H & Paetzold G (Eds.) (2018) . Springer. RIS download Bibtex download

Journal articles

  • Peng B, He W, Chen B, Villavicencio A & Wu C (2024) . Pattern Recognition Letters, 178, 84-90. RIS download Bibtex download
  • Gow-Smith E, Phelps D, Madabushi HT, Scarton C & Villavicencio A (2024) Word Boundary Information Isn't Useful for Encoder Language Models.. CoRR, abs/2401.07923. RIS download Bibtex download
  • Yamaguchi A, Villavicencio A & Aletras N (2024) Vocabulary Expansion for Low-resource Cross-lingual Transfer.. CoRR, abs/2406.11477. RIS download Bibtex download
  • He W, Idiart M, Scarton C & Villavicencio A (2024) Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss.. CoRR, abs/2406.15175. RIS download Bibtex download
  • Yamaguchi A, Villavicencio A & Aletras N (2024) An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Generative LLM Inference.. CoRR, abs/2402.10712. RIS download Bibtex download
  • He W, Farrahi K, Chen B, Peng B & Villavicencio A (2023) . Pattern Recognition Letters. RIS download Bibtex download
  • Villavicencio A & Van Durme B (2020) Introduction. EMNLP 2020 - Conference on Empirical Methods in Natural Language Processing, Tutorial Abstracts, III. RIS download Bibtex download
  • Villavicencio A & Idiart M (2019) . Natural Language Engineering, 25(6), 715-733. RIS download Bibtex download
  • Idiart MAP, Villavicencio A, Katz B, Rennó-Costa C & Lisman J (2019) . Frontiers in Computational Neuroscience, 13. RIS download Bibtex download
  • Cordeiro S, Villavicencio A, Idiart M & Ramisch C (2019) . Computational Linguistics, 45(1), 1-57. RIS download Bibtex download
  • Wilkens R, Vecchia AD, Boito MZ, Padró M & Villavicencio A (2014) , 129-140. RIS download Bibtex download
  • Becker N, de Lima Müller J, de Carvalho Rodrigues J, Villavicencio A & de Salles JF (2014) . ³¢±ð³Ù°ùô²Ô¾±³¦²¹, 7(1), 325-347. RIS download Bibtex download
  • Zortea M, Menegola B, Villavicencio A & de Salles JF (2014) . Psicologia: Reflexão e Crítica, 27(1), 90-99. RIS download Bibtex download
  • Ramisch C, Villavicencio A & Kordoni V (2013) . ACM Transactions on Speech and Language Processing, 10(2), 1-10. RIS download Bibtex download
  • Villavicencio A (2012) . Natural Language Engineering, 18(4), 575-579. RIS download Bibtex download
  • De Almeida L, Idiart M, Villavicencio A & Lisman J (2012) . Hippocampus, 22(8), 1647-1651. RIS download Bibtex download
  • de Caseli HM, Ramisch C, das Graças Volpe Nunes M & Villavicencio A (2010) . Language Resources and Evaluation, 44(1-2), 59-77. RIS download Bibtex download
  • Baldwin T, Kordoni V & Villavicencio A (2009) . Computational Linguistics, 35(2), 119-149. RIS download Bibtex download
  • Villavicencio A (2005) . Computer Speech & Language, 19(4), 415-432. RIS download Bibtex download
  • Villavicencio A, Bond F, Korhonen A & McCarthy D (2005) . Computer Speech & Language, 19(4), 365-377. RIS download Bibtex download
  • He W, Vieira TK, Garcia M, Scarton C, Idiart M & Villavicencio A () . Computational Linguistics, 1-48. RIS download Bibtex download
  • He W, Vieira TK, Gonzalez MG, Scarton C, Idiart M & Villavicencio A () Finding Idiomaticity in Word Representations. Computational Linguistics. RIS download Bibtex download
  • Soroka G, Idiart M & Villavicencio A () . PLOS ONE, 19(2), e0296217-e0296217. RIS download Bibtex download
  • Wilkens R, Zilio L & Villavicencio A () . Language Resources and Evaluation. RIS download Bibtex download
  • Salle A & Villavicencio A () Understanding the Effects of Negative (and Positive) Pointwise Mutual Information on Word Vectors. Journal of Experimental and Theoretical Artificial Intelligence. RIS download Bibtex download
  • Villavicencio A, Sadler L & Arnold D () . Proceedings of the International Conference on Head-Driven Phrase Structure Grammar. RIS download Bibtex download
  • Villavicencio A & Copestake A () . Proceedings of the International Conference on Head-Driven Phrase Structure Grammar. RIS download Bibtex download
  • Boito MZ, Villavicencio A & Besacier L () Investigating Alignment Interpretability for Low-resource NMT. Machine Translation. RIS download Bibtex download

Chapters

  • Villavicencio A (2020) , IVITRA Research in Linguistics and Literature (pp. viii-xi). John Benjamins Publishing Company RIS download Bibtex download
  • Poibeau T & Villavicencio A (2018) , Language, Cognition, and Computational Models (pp. 3-24). RIS download Bibtex download
  • Ramisch C & Villavicencio A (2018) , The Oxford Handbook of Computational Linguistics 2nd edition RIS download Bibtex download
  • Boos R, Prestes K, Villavicencio A & Padró M (2014) , Lecture Notes in Computer Science (pp. 201-206). Springer International Publishing RIS download Bibtex download
  • Parente MA, Villavicencio A, Siqueira M, Chen P & Tonietto L (2013) The Lexical Bootstrapping Hypothesis and conventionality: A crosslinguistic study on verb acquisition by Chinese Mandarin- and Brazilian Portuguese-speaking children, Lexical Bootstrapping: The Role of Lexis and Semantics in Child Language Development (pp. 73-97). RIS download Bibtex download
  • Poibeau T, Villavicencio A, Korhonen A & Alishahi A (2013) , Cognitive Aspects of Computational Language Acquisition (pp. 1-25). Springer Berlin Heidelberg RIS download Bibtex download
  • Villavicencio A (2011) , Non-Transformational Syntax (pp. 404-442). RIS download Bibtex download
  • Arnold D, Sadler L & Villavicencio A (2008) Portuguese: Corpora, coordination and agreement, Roots: Linguistics in Search of its Evidential Base (pp. 9-28). RIS download Bibtex download
  • Villavicencio A (2006) , Syntax and Semantics of Prepositions (pp. 115-130). RIS download Bibtex download

Conference proceedings papers

  • Phelps D, Pickard T, Mi M, Gow-Smith E & Villavicencio A (2024) Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection. Joint Workshop on Multiword Expressions and Universal Dependencies, MWE-UD 2024 at LREC-COLING 2024 - Workshop Proceedings (pp 178-187) RIS download Bibtex download
  • He W, Idiart M, Scarton C & Villavicencio A (2024) Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss.. ACL (Findings) (pp 12473-12485) RIS download Bibtex download
  • Gibbons M, Mi M, Villavicencio A & Song X (2024) ShefCDTeam at SemEval-2024 Task 4: A Text-to-Text Model for Multi-Label Classification. PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024 (pp 1860-1867) RIS download Bibtex download
  • Zhao K, Yang B, Lin C, Rong W, Villavicencio A & Cui X (2023) Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Vol. 1 (pp 562-574) RIS download Bibtex download
  • Peng B, Wu C, He W, Thorne W, Villavicencio A, Wang Y & Paes A (2023) FLYPE: Multitask Prompt Tuning for Multimodal Human Understanding of Social Media. CEUR Workshop Proceedings, Vol. 3566 (pp 18-33) RIS download Bibtex download
  • Phelps D, Fan X-R, Gow-Smith E, Madabushi HT, Scarton C & Villavicencio A (2022) Sample Efficient Approaches for Idiomaticity Detection. Proceedings of the 18th Workshop on Multiword Expressions (MWE 2022) RIS download Bibtex download
  • Madabushi HT, Gow-Smith E, Garcia M, Scarton C, Idiart M & Villavicencio A (2022) SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) RIS download Bibtex download
  • Tayyar Madabushi H, Gow-Smith E, Garcia M, Scarton C, Idiart M & Villavicencio A (2022) . Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), July 2022 - July 2022. RIS download Bibtex download
  • Boito MZ, Yusuf B, Ondel L, Villavicencio A & Besacier L (2022) Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages, SIGUL 2022 - held in conjunction with the International Conference on Language Resources and Evaluation, LREC 2022 - Proceedings (pp 1-9) RIS download Bibtex download
  • Muresan S, Nakov P & Villavicencio A (2022) Message from the Program Chairs. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp vii-xi) RIS download Bibtex download
  • Muresan S, Nakov P & Villavicencio A (2022) Message from the Program Chairs. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Vol. 1 (pp vii-xi) RIS download Bibtex download
  • Phelps D, Fan XR, Gow-Smith E, Madabushi HT, Scarton C & Villavicencio A (2022) Sample Efficient Approaches for Idiomaticity Detection. LREC 2022 Workshop - Language Resources and Evaluation Conference, 18th Workshop on Multiword Expressions, MWE 2022 - Proceedings (pp 105-111) RIS download Bibtex download
  • Madabushi HT, Gow-Smith E, García M, Scarton C, Idiart M & Villavicencio A (2022) SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding.. SemEval@NAACL (pp 107-121) RIS download Bibtex download
  • Boito MZ, Yusuf B, Ondel L, Villavicencio A & Besacier L (2021) Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. Proceedings of 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2022), 24 June 2022 - 25 June 2022. RIS download Bibtex download
  • Garcia M, Kramer Vieira T, Scarton C, Idiart M & Villavicencio A (2021) . Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), August 2021 - August 2021. RIS download Bibtex download
  • Villavicencio A (2021) What if the whole is greater than the sum of the parts? Modelling Complex (Multiword) Expressions. CEUR Workshop Proceedings, Vol. 2944 (pp 1-10) RIS download Bibtex download
  • UrriyetÇ’glu AH, Tanev H, Zavarella V, Piskorski J, Yeniterzi R, Ÿoruk E, Mutlu O, Ÿuret D & Villavicencio A (2021) Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021): Workshop and Shared Task Report. 4th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, CASE 2021 - Proceedings (pp 1-9) RIS download Bibtex download
  • Tayyar Madabushi H, Gow-Smith E, Scarton C & Villavicencio A (2021) . Findings of the Association for Computational Linguistics: EMNLP 2021. Punta Cana, Dominican Republic, 7 November 2021 - 11 November 2021. RIS download Bibtex download
  • Madabushi HT, Gow-Smith E, Scarton C & Villavicencio A (2021) AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models.. EMNLP (Findings) (pp 3464-3477) RIS download Bibtex download
  • Gamallo P, Garcia M, Martín-Rodilla P, Pereira-Fariña M, Real L, Tonelli S, Quaresma P, Vieira R, Dias G, Oostdijk N , Villavicencio A et al (2020) Preface. CEUR Workshop Proceedings, Vol. 2693 RIS download Bibtex download
  • Villavicencio A (2019) . Proceedings of the Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019). Florence, Italy, 2 August 2019 - 2 August 2019. RIS download Bibtex download
  • Villavicencio A & Bansal M (2019) Introduction. CoNLL 2019 - 23rd Conference on Computational Natural Language Learning, Proceedings of the Conference (pp iii-iv) RIS download Bibtex download
  • Wagner Filho JA, Wilkens R, Idiart M & Villavicencio A (2019) The BRWAC corpus: A new open resource for Brazilian Portuguese. LREC 2018 - 11th International Conference on Language Resources and Evaluation (pp 4339-4344) RIS download Bibtex download
  • Godard P, Boito MZ, Ondel L, Berard A, Yvon F, Villavicencio A & Besacier L (2018) . Proceedings of Interspeech 2018 (pp 2678-2682), 2 September 2018 - 6 September 2018. RIS download Bibtex download
  • Ramisch C, Ramisch R, Zilio L, Villavicencio A & Cordeiro S (2018) . Computational Processing of the Portuguese Language (pp 24-34). Canela, Brazil, 24 September 2018 - 26 September 2018. RIS download Bibtex download
  • Salle A & Villavicencio A (2018) . ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, Vol. 2 (pp 8-13) RIS download Bibtex download
  • Paula F, Wilkens R, Idiart M & Villavicencio A (2018) . Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), 1 June 2018 - 6 June 2018. RIS download Bibtex download
  • Boito MZ, Berard A, Villavicencio A & Besacier L (2017) . 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16 December 2017 - 20 December 2017. RIS download Bibtex download
  • Wilkens R, Zilio L, Cordeiro S, Paula FSF, Ramisch C, Idiart M & Villavicencio A (2017) LexSubNC: A dataset of lexical substitution for nominal compounds. 12th International Conference on Computational Semantics, IWCS 2017 - Short Papers RIS download Bibtex download
  • Cordeiro S, Ramisch C, Idiart M & Villavicencio A (2016) . Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp 1986-1997). Berlin, Germany, 7 August 2016 - 12 August 2016. RIS download Bibtex download
  • Salle A, Villavicencio A & Idiart M (2016) . Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (pp 419-424), 7 August 2016 - 12 August 2016. RIS download Bibtex download
  • Wilkens R, Idiart M & Villavicencio A (2016) Multiword expressions in child language. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 2307-2311) RIS download Bibtex download
  • Zilio L, Finatto MJB & Villavicencio A (2016) Verblexpor: A lexical resource with semantic roles for Portuguese. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 2656-2661) RIS download Bibtex download
  • Ramisch C, Cordeiro S, Zilio L, Idiart M & Villavicencio A (2016) . Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), August 2016 - August 2016. RIS download Bibtex download
  • Ramisch C, Cordeiro S & Villavicencio A (2016) . Proceedings of the 12th Workshop on Multiword Expressions, August 2016 - August 2016. RIS download Bibtex download
  • Wilkens R, Zilio L, Ferreira E & Villavicencio A (2016) (pp 333-339) RIS download Bibtex download
  • Cordeiro S, Ramisch C & Villavicencio A (2016) . Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), June 2016 - June 2016. RIS download Bibtex download
  • Wilkens R, Zilio L, Ferreira E & Villavicencio A (2016) B2SG: A toefl-like task for Portuguese. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 3659-3662) RIS download Bibtex download
  • Zilio L, Wilkens R, Möllmann L, Wehrli E, Cordeiro S & Villavicencio A (2016) (pp 233-238) RIS download Bibtex download
  • Cordeiro S, Ramisch C & Villavicencio A (2016) Mwetoolkit+sem: Integrating word embeddings in the mwetoolkit for semantic MWE processing. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 1221-1225) RIS download Bibtex download
  • Filho JAW, Wilkens R, Zilio L, Idiart M & Villavicencio A (2016) (pp 306-318) RIS download Bibtex download
  • Boos RAS, Prestes KV & Villavicencio A (2014) Identification of multiword expressions in the brWaC. Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 (pp 728-735) RIS download Bibtex download
  • Padró M, Idiart M, Villavicencio A & Ramisch C (2014) . Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), October 2014 - October 2014. RIS download Bibtex download
  • Laranjeira BR, Moreira VP, Villavicencio A, Ramisch C & Finatto MJ (2014) Comparing the quality of focused crawlers and of the translation resources obtained from them. Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 (pp 3572-3578) RIS download Bibtex download
  • Padró M, Idiart M, Villavicencio A & Ramisch C (2014) Comparing similarity measures for distributional thesauri. Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 (pp 2964-2971) RIS download Bibtex download
  • (2014) RIS download Bibtex download
  • Villavicencio A, Idiart M, Berwick R & Malioutov I (2013) Language acquisition and probabilistic models: Keeping it simple. ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, Vol. 1 (pp 1321-1330) RIS download Bibtex download
  • Kordoni V, Ramisch C & Villavicencio A (2013) Introduction. Proceedings of the 9th Workshop on Multiword Expressions, MWE 2013 - in conjunction with the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013 (pp III-IV) RIS download Bibtex download
  • Villavicencio A, Yankama B, Idiart MAP & Berwick R (2012) A large scale annotated child language construction database. Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012 (pp 2370-2374) RIS download Bibtex download
  • Gonçalves G, Wilkens R & Villavicencio A (2011) Semi-automatic acquisition system of ontologies. CEUR Workshop Proceedings, Vol. 776 (pp 189-194) RIS download Bibtex download
  • Prestes K, Wilkens R, Zillio L & Villavicencio A (2011) Extraction and validation of ontologies from digital resources. CEUR Workshop Proceedings, Vol. 776 (pp 183-188) RIS download Bibtex download
  • Acosta OC, Villavicencio A & Moreira VP (2011) Identification and treatment of multiword expressions applied to information retrieval. Workshop on Multiword Expressions: From Parsing and Generation to the Real World, MWE 2011 at the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011 - Proceedings (pp 101-109) RIS download Bibtex download
  • De Araujo V, Ramisch C & Villavicencio A (2011) Fast and flexible MWE candidate generation with the MWE toolkit. Workshop on Multiword Expressions: From Parsing and Generation to the Real World, MWE 2011 at the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011 - Proceedings (pp 134-136) RIS download Bibtex download
  • Duran MS, Ramisch C, Aluísio SM & Villavicencio A (2011) Identifying and analyzing Brazilian portuguese complex predicates. Workshop on Multiword Expressions: From Parsing and Generation to the Real World, MWE 2011 at the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011 - Proceedings (pp 74-82) RIS download Bibtex download
  • Kordoni V, Ramisch C & Villavicencio A (2011) Introduction. Workshop on Multiword Expressions: From Parsing and Generation to the Real World, MWE 2011 at the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011 - Proceedings (pp III-IV) RIS download Bibtex download
  • Ramisch C, de Medeiros Caseli H, Villavicencio A, Machado A & Finatto MJ (2010) (pp 65-74) RIS download Bibtex download
  • Wilkens R & Villavicencio A (2010) (pp 173-182) RIS download Bibtex download
  • Ramisch C, Villavicencio A & Boitet C (2010) Web-based and combined language models: A case study on noun compound identification. Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference, Vol. 2 (pp 1041-1049) RIS download Bibtex download
  • Ramisch C, Villavicencio A & Boitet C (2010) Multiword expressions in the wild? the mwetoolkit comes in handy. Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference, Vol. 2 (pp 57-60) RIS download Bibtex download
  • Wilkens R, Villavicencio A, Muller D, Wives L, De Silva F & Loh S (2010) COMUNICA - A question answering system for brazilian portuguese. Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference, Vol. 2 (pp 21-24) RIS download Bibtex download
  • Germann DC, Villavicencio A & Siqueira M (2010) An investigation on the influence of frequency on the lexical organization of verbs. ACL 2010 - TextGraphs 2010: 2010 Workshop on Graph-Based Methods for Natural Language Processing, Proceedings of the Workshop (pp 19-23) RIS download Bibtex download
  • Ramisch C, Villavicencio A & Boitet C (2010) Mwetoolkit: A framework for multiword expression identification. Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010 (pp 662-669) RIS download Bibtex download
  • Germann DC, Villavicencio A & Siqueira M (2010) An investigation on the influence of frequency on the lexical organization of verbs. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp 19-23) RIS download Bibtex download
  • Villavicencio A, Caseli HDM & Machado A (2009) . 2009 Seventh Brazilian Symposium in Information and Human Language Technology, 8 September 2009 - 11 September 2009. RIS download Bibtex download
  • (2009) . 2009 Seventh Brazilian Symposium in Information and Human Language Technology, 8 September 2009 - 11 September 2009. RIS download Bibtex download
  • Caseli HDM, Villavicencio A, Machado A & Finatto MJ (2009) . Proceedings of the Workshop on Multiword Expressions Identification, Interpretation, Disambiguation and Applications - MWE '09, 6 August 2009 - 6 August 2009. RIS download Bibtex download
  • Ramisch C, Villavicencio A, Moura L & Idiart M (2008) Picking them up and figuring them out: Verb-particle constructions, noise and idiomaticity. CoNLL 2008 - Proceedings of the Twelfth Conference on Computational Natural Language Learning (pp 49-56) RIS download Bibtex download
  • Ramisch C, Villavicencio A, Moura L & Idiart M (2008) . Proceedings of the Twelfth Conference on Computational Natural Language Learning - CoNLL '08, 16 August 2008 - 17 August 2008. RIS download Bibtex download
  • Acosta OC, Geraldo AP, Orengo VM & Villavicencio A (2008) UFRGS@CLEF2008: Indexing Multiword Expressions for Information Retrieval. CEUR Workshop Proceedings, Vol. 1174 RIS download Bibtex download
  • Villavicencio A, Kordoni V, Zhang Y, Idiart M & Ramisch C (2007) Validation and evaluation of automatically acquired multiword expressions for grammar engineering. EMNLP-CoNLL 2007 - Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (pp 1034-1043) RIS download Bibtex download
  • Zhang Y, Kordoni V, Villavicencio A & Idiart M (2006) . Proceedings of the Workshop on Multiword Expressions Identifying and Exploiting Underlying Properties - MWE '06, 23 July 2006 - 23 July 2006. RIS download Bibtex download
  • Villavicencio A, Copestake A, Waldron B & Lambeau F (2004) . Proceedings of the Workshop on Multiword Expressions Integrating Processing - MWE '04, 26 July 2004 - 26 July 2004. RIS download Bibtex download
  • Villavicencio A, Baldwin T & Waldron B (2004) A multilingual database of idioms. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 (pp 1127-1130) RIS download Bibtex download
  • Copestake A, Lambeau F, Villavicencio A, Bond F, Baldwin T, Sag IA & Flickinger D (2002) Multiword expressions: Linguistic precision and reusability. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 1941-1947) RIS download Bibtex download
  • Baldwin T & Villavicencio A (2002) Extracting the Unextractable: A Case Study on Verb-particles. Proceedings of the Annual Meeting of the Association for Computational Linguistics RIS download Bibtex download
  • Villavicencio A (2002) Learning to Distinguish PP Arguments from Adjuncts. Proceedings of the Annual Meeting of the Association for Computational Linguistics RIS download Bibtex download
  • Villavicencio A (2000) The acquisition of word order by a computational learning system. Proceedings of the 4th Conference on Computational Natural Language Learning, CoNLL 2000 and of the 2nd Learning Language in Logic Workshop, LLL 2000 - Held in cooperation with ICGI 2000 (pp 209-218) RIS download Bibtex download
  • Villavicencio A (1999) Representing a system of lexical types using default unification. 9th Conference of the European Chapter of the Association for Computational Linguistics, EACL 1999 (pp 261-264) RIS download Bibtex download
  • McFetridge P & Villavicencio A (1995) (pp 302-311) RIS download Bibtex download
  • Villavicencio A, Lopes JGP, Marques NMC & Villavicencio F (1995) (pp 323-332) RIS download Bibtex download
  • Yamaguchi A, Villavicencio A & Aletras N () An empirical study on cross-lingual vocabulary adaptation for efficient language model inference. Findings of the Association for Computational Linguistics: EMNLP 2024. Miami, Florida, 12 November 2024 - 12 November 2024. RIS download Bibtex download
  • Gow-Smith E, Madabushi HT, Scarton C & Villavicencio A () Improving Tokenisation by Alternative Treatment of Spaces. Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing RIS download Bibtex download
  • Bigoulaeva I, Sachdeva RS, Madabushi HT, Villavicencio A & Gurevych I () Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5. Proceedings of the 3rd Workshop on Figurative Language Processing RIS download Bibtex download
  • Boito MZ, Villavicencio A & Besacier L () . Interspeech 2019 RIS download Bibtex download
  • Paula F, Wilkens R, Idiart M & Villavicencio A () . LatinX in AI at Neural Information Processing Systems Conference 2018 RIS download Bibtex download
  • Zanon Boito M, Anastasopoulos A, Villavicencio A, Besacier L & Lekakou M () . The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages RIS download Bibtex download
  • Garcia M, Vieira TK, Scarton C, Idiart M & Villavicencio A () Probing for idiomaticity in vector space models. Proceedings of the 16th conference of the European Chapter of the Association for Computational Linguistics RIS download Bibtex download
  • Boito MZ, Villavicencio A & Besacier L () Investigating Language Impact in Bilingual Approaches for Computational Language Documentation. 1st Joint Workshop of Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages (SLTU-CCURL 2020) RIS download Bibtex download
  • Villavicencio A, Garcia M, Idiart M, Kramer Vieira T & Scarton C () Assessing Idiomaticity Representations in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels. Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) RIS download Bibtex download
  • Vickers P, Wainwright R, Tayyar Madabushi H & Villavicencio A () CogNLP-91Ö±²¥ at CMCL 2021 Shared Task: Blending Cognitively Inspired Features with Transformer-based Language Models for Predicting Eye Tracking Patterns. Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2021) RIS download Bibtex download

Preprints

  • He W, Vieira TK, Garcia M, Scarton C, Idiart M & Villavicencio A (2024) Investigating Idiomaticity in Word Representations, arXiv. RIS download Bibtex download
  • Mi M, Villavicencio A & Moosavi NS (2024) , arXiv. RIS download Bibtex download
  • Ribeiro M, Malcorra B, Mota NB, Wilkens R, Villavicencio A, Hubner LC & Rennó-Costa C (2024) A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification, arXiv. RIS download Bibtex download
  • He W, Idiart M, Scarton C & Villavicencio A (2024) , arXiv. RIS download Bibtex download
  • Yamaguchi A, Villavicencio A & Aletras N (2024) , arXiv. RIS download Bibtex download
  • Phelps D, Pickard T, Mi M, Gow-Smith E & Villavicencio A (2024) , arXiv. RIS download Bibtex download
  • Knietaite A, Allsebrook A, Minkov A, Tomaszewski A, Slinko N, Johnson R, Pickard T, Phelps D & Villavicencio A (2024) , arXiv. RIS download Bibtex download
  • Yamaguchi A, Villavicencio A & Aletras N (2024) , arXiv. RIS download Bibtex download
  • Gow-Smith E, Phelps D, Madabushi HT, Scarton C & Villavicencio A (2024) Word Boundary Information Isn't Useful for Encoder Language Models, arXiv. RIS download Bibtex download
  • Wilkens R, Zilio L & Villavicencio A (2023) Assessing Linguistic Generalisation in Language Models: A Dataset for Brazilian Portuguese, arXiv. RIS download Bibtex download
  • Bigoulaeva I, Sachdeva R, Madabushi HT, Villavicencio A & Gurevych I (2022) , arXiv. RIS download Bibtex download
  • Gow-Smith E, Madabushi HT, Scarton C & Villavicencio A (2022) Improving Tokenisation by Alternative Treatment of Spaces, arXiv. RIS download Bibtex download
  • Madabushi HT, Gow-Smith E, Scarton C & Villavicencio A (2021) AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models. RIS download Bibtex download
  • Boito MZ, Villavicencio A & Besacier L (2020) Investigating Language Impact in Bilingual Approaches for Computational Language Documentation, arXiv. RIS download Bibtex download
  • Boito MZ, Villavicencio A & Besacier L (2019) How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages, arXiv. RIS download Bibtex download
  • Boito MZ, Villavicencio A & Besacier L (2019) Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings, arXiv. RIS download Bibtex download
  • Boito MZ, Anastasopoulos A, Lekakou M, Villavicencio A & Besacier L (2018) A small Griko-Italian speech translation corpus, arXiv. RIS download Bibtex download
  • Godard P, Zanon-Boito M, Ondel L, Berard A, Yvon F, Villavicencio A & Besacier L (2018) Unsupervised Word Segmentation from Speech with Attention, arXiv. RIS download Bibtex download
  • Boito MZ, Berard A, Villavicencio A & Besacier L (2017) Unwritten Languages Demand Attention Too! Word Discovery with Encoder-Decoder Models, arXiv. RIS download Bibtex download
  • Salle A, Idiart M & Villavicencio A (2016) Enhancing the LexVec Distributed Word Representation Model Using Positional Contexts and External Memory, arXiv. RIS download Bibtex download
  • Salle A, Idiart M & Villavicencio A (2016) Matrix Factorization using Window Sampling and Negative Sampling for Improved Word Representations, arXiv. RIS download Bibtex download
  • Soares F () . RIS download Bibtex download
Grants

Current Grants

Previous grants

  • Modelling the link between working memory and language deficits in schizophrenia, Royal Society, 12/2020 - 09/2024, £74,000, as Co-PI
  • , EPSRC, 12/2020 - 11/2024, £446,163, as PI
Professional activities and memberships

Some of her recent activities include being the PC co-chair of the Conference on Computational Natural Language Learning (CoNLL-2019), Area Chair for events like ACL-2019, , , and General co-chair for the  (PROPOR 2018).

She is a member of the advisory board of WiNLP, of the editorial board of TACL, JNLE, Journal of Language Modelling and Linguamatica, and a reviewer for various conferences, in addition to having co-chaired numerous *ACL workshops on Cognitive Aspects of Computational Language Acquisition and on Multiword Expressions. She has also co-edited special issues and books dedicated to these topics.

She is a member of the Natural Language Processing group at the University of 91Ö±²¥ and of the  of the  (Brazil).