Dr Xingyi Song
School of Computer Science
Lecturer in Computational Media Analysis, Natural Language Processing
Member of the Natural Language Processing research group
+44 114 222 1867
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
91Ö±²¥
S1 4DP
- Profile
-
Dr Xingyi Song, a Lecturer in Computational Media Analysis at the Department of Computer Science, University of 91Ö±²¥. He is a member of the Natural Language Processing group and GATE team ()
Previously he worked as a machine translation specialist at Iconic Translation Machine (2015-2016) and Research Associate for several EU funded projects such as Kconnect, Knowmak and Risis2 (from 2016-2021)) at the University of 91Ö±²¥.
He completed his MSc and PhD in Natural Language Processing group at the University of 91Ö±²¥. His research interests are in Natural Language Processing, Computational Social Science, sentiment analysis and Bio-medical text processing.
- Publications
-
Journal articles
- . PLoS ONE, 19(5).
- Examining Temporalities on Stance Detection Towards COVID-19 Vaccination. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 6732-6738.
- Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 12074-12086.
- Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 6739-6751.
- Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 10160-10171.
- . Information Sciences, 647, 119446-119446.
- . International Conference Recent Advances in Natural Language Processing, RANLP, 648-657.
- . Findings of the Association for Computational Linguistics: EMNLP 2023.
- VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter.. CoRR, abs/2301.06660.
- Finding Already Debunked Narratives via Multistage Retrieval: Enabling Cross-Lingual, Cross-Dataset and Zero-Shot Learning.. CoRR, abs/2308.05680.
- . SN Computer Science, 4(1).
- . BMJ Open, 11(3).
- . Scientometrics.
- . Structural and Multidisciplinary Optimization.
- . BMC Medical Informatics and Decision Making, 18.
- .
- . Proceedings of the International AAAI Conference on Web and Social Media, 17, 1052-1062.
- . SSRN Electronic Journal.
- . PLOS ONE, 16(2), e0247086-e0247086.
Chapters
- , Lecture Notes in Computer Science (pp. 449-458). Springer Nature Switzerland
- , Lecture Notes in Computer Science (pp. 28-52). Springer Nature Switzerland
Conference proceedings papers
- Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings (pp 8580-8593)
- Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science.. LREC/COLING (pp 12074-12086)
- Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling.. LREC/COLING (pp 10160-10171)
- Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets.. LREC/COLING (pp 6739-6751)
- Examining Temporalities on Stance Detection towards COVID-19 Vaccination.. LREC/COLING (pp 6732-6738)
- . Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), June 2024 - June 2024.
- Overview of the CLEF-2024 CheckThat! Lab Task 6 on Robustness of Credibility Assessment with Adversarial Examples (InCrediblAE). CEUR Workshop Proceedings, Vol. 3740 (pp 321-338)
- . Proceedings of the Ninth Conference on Machine Translation (pp 1004-1010), November 2024 - November 2024.
- . Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (pp 12477-12492), November 2024 - November 2024.
- GATE Teamware 2: An open-source tool for collaborative document classification annotation. EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of System Demonstrations (pp 145-151)
- . Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), July 2023 - July 2023.
- . International Conference Recent Advances in Natural Language Processing, RANLP (pp 666-672)
- . International Conference Recent Advances in Natural Language Processing, RANLP (pp 556-567)
- A Large-Scale Comparative Study of Accurate COVID-19 Information versus Misinformation.. CoRR, Vol. abs/2304.04811
- Classifying COVID-19 Vaccine Narratives.. RANLP (pp 648-657)
- Don't waste a single annotation: improving single-label classifiers through soft labels.. EMNLP (Findings) (pp 5347-5355)
- (pp 128-143)
- Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation.. SocInfo, Vol. 13618 (pp 128-143)
- . Proceedings of 24th European Conference on Artificial Intelligence (ECAI 2020), Vol. 325 (pp 2054-2061). Santiago de Compostela, Spain, 29 August 2020 - 2 September 2020.
- Using deep neural networks with intra- And inter-sentence context to classify suicidal behaviour. LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp 1303-1310)
- RP-DNN: A tweet level propagation context based deep neural networks for early rumor detection in social media. LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp 6094-6105)
- . Proceedings of the 13th International Workshop on Semantic Evaluation, June 2019 - June 2019.
- Team Bertha von Suttner at SemEval-2019 Task 4: Hyperpartisan News Detection using ELMo Sentence Representation Convolutional Network. Proceedings of the 13th International Workshop on Semantic Evaluation. Minneapolis, Minnesota, USA, 6 June 2019 - 7 June 2019.
- . Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (pp 900-904). Brussels, Belgium, 31 October 2018 - 4 November 2018.
- A Deep Neural Network Sentence Level Classification Method with Context Information. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) (pp 900-904)
- . Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, September 2017 - September 2017.
- 91Ö±²¥ Systems for the English-Romanian WMT Translation Task. Proceedings of the First Conference on Machine Translation
- Data selection for discriminative training in statistical machine translation. Proceedings of the 17th Annual Conference of the European Association for Machine Translation, EAMT 2014 (pp 45-52)
- BLEU deconstructed: Designing a Better MT Evaluation Metric. Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING)
- Regression and Ranking based Optimisation for Sentence Level Machine Translation Evaluation. Proceedings of the Sixth Workshop on Statistical Machine Translation. Edinburgh, UK
Datasets
Preprints
- Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research, arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , Cold Spring Harbor Laboratory.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , Research Square Platform LLC.
- , arXiv.
- , arXiv.
- , arXiv.
- Grants
-
ASIMOV: AI-as-a-service, Innovate UK, 01/2024 - 03/2025, £142,691, as PI.