Dr Diana Maynard
School of Computer Science
Senior Research Fellow
Deputy Head of the Natural Language Processing research group
d.maynard@sheffield.ac.uk
+44 114 222 1938
+44 114 222 1938
Regent Court (DCS)
Full contact details
Dr Diana Maynard
School of Computer Science
Regent Court (DCS)
211 Portobello
91Ö±²¥
S1 4DP
School of Computer Science
Regent Court (DCS)
211 Portobello
91Ö±²¥
S1 4DP
- Research interests
-
- Information extraction
- GATE
- Social media analysis
- Sentiment analysis
- Online abuse and misinformation detection
- Term recognition
- Ontologies and semantic web
- Freedom of the media
- NLP for scientometrics
- Publications
-
Books
- The Chilling: A global study of online violence against women journalists. ICFJ.
- . Springer International Publishing.
- . Morgan & Claypool Publishers.
- Text Processing with Gate (Version 6). GATE.
- Preface.
- Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface.
- Preface.
- Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface.
Journal articles
- . Information Sciences, 647, 119446-119446.
- . Frontiers in Artificial Intelligence, 3.
- . PLoS ONE, 16(2).
- . Scientometrics.
- . Media and Communication, 8(1), 89-100.
- . Technological Forecasting and Social Change.
- Pro-Environmental Campaigns via Social Media: Analysing Awareness and Behaviour Patterns.. Journal of Web Science, 3(1), 1-15.
- . Journal of Web Semantics, 44, 75-88.
- . Semantic Web, 7(4), 335-349.
- , 139-155.
- , 65-86.
- . Information Processing & Management, 51(2), 32-49.
- . Journal of Web Semantics, 24, 1-2.
- . Future Internet, 6(3), 433-456.
- . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7117 LNCS, 88-99.
- . Journal of Web Semantics.
- . Journal of Web Semantics, 9(3), 315.
- Automatic detection of political opinions in tweets. CEUR Workshop Proceedings, 718, 81-92.
- Using lexico-syntactic ontology design patterns for ontology creation and population. CEUR Workshop Proceedings, 516, 39-52.
- NLP-based support for ontology lifecycle development. CEUR Workshop Proceedings, 514.
- Information extraction: Algorithms and prospects in a retrieval context. COMPUT LINGUIST, 34(2), 315-317.
- NLP techniques for term extraction and ontology population. Frontiers in Artificial Intelligence and Applications, 167(1), 107-127.
- . New Review of Hypermedia and Multimedia, 13(2), 211-237.
- Preface.. IBM Syst. J., 45, 3-6.
- . Natural Language Engineering, 10(3-4), 349-373.
- . Literary and Linguistic Computing, 19(4), 509-524.
- . Journal of Natural Language Engineering, 8(2-3), 257-274.
- . Journal of Natural Language Processing, 8(1), 101-125.
- . SSRN Electronic Journal.
- . Future Internet, 6(3), 457-481.
Chapters
- , European Language Equality (pp. 127-130). Springer International Publishing
- Preface (pp. V-VII).
- Challenges in Analysing Social Media. In Dusa A, Nelle D, Stock G & Wagner G (Ed.), Facing the Future: European Research Infrastructures for the Humanities and Social Sciences Berlin: SCIVERO Verlag.
- Natural language processing, Perspectives on Ontology Learning (pp. 51-67).
- In Weller K, Bruns A, Burgess J, Mahrt M & Puschmann C (Ed.), Twitter and Society USA: Peter Lang.
Conference proceedings papers
- . Findings of the Association for Computational Linguistics: EMNLP 2023, December 2023 - December 2023.
- Development of a Benchmark Corpus to Support Entity Recognition in Job Descriptions. 2022 Language Resources and Evaluation Conference, LREC 2022 (pp 1201-1208)
- . Proceedings of the 13th International Workshop on Semantic Evaluation, June 2019 - June 2019.
- Using ontologies to map between research and policy data: opportunities and challenges. Proceedings of the 17th International Conference on Scientometrics & Informetrics, Vol. 1 (pp 535-540). Rome, Italy, 2 September 2019 - 5 September 2019.
- Team Bertha von Suttner at SemEval-2019 Task 4: Hyperpartisan News Detection using ELMo Sentence Representation Convolutional Network. Proceedings of the 13th International Workshop on Semantic Evaluation. Minneapolis, Minnesota, USA, 6 June 2019 - 7 June 2019.
- Exploring knowledge production in Europe. The KNOWMAK tool. 17th International Conference on Scientometrics and Informetrics, ISSI 2019 - Proceedings, Vol. 2 (pp 2561-2562)
- . Procedia Computer Science, Vol. 137 (pp 102-108), 10 September 2018 - 13 September 2018.
- . The Semantic Web – ISWC 2018, Vol. 11136 (pp 617-633). Monterey, CA, USA,, 8 October 2018 - 12 October 2018.
- Helping crisis responders find the informative needle in the tweet haystack. Proceedings of the 15th ISCRAM Conference (pp 649-662). Rochester, NY, USA, 20 May 2018 - 23 May 2018.
- Twits, twats and twaddle: Trends in online abuse towards UK politicians. 12th International AAAI Conference on Web and Social Media, ICWSM 2018 (pp 600-603)
- Ontologies as bridges between data sources and user queries: the KNOWMAK project experience.. Proc. of STI 2017
- . Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, September 2017 - September 2017.
- Towards an Infrastructure for Understanding and Interlinking Knowledge Co-Creation in European research. Proceedings of ESWC 2017 Workshop on Scientometrics. Portoroz, 28 May 2017 - 1 June 2017.
- GATE-time: Extraction of temporal expressions and events. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 3702-3708)
- . Proceedings of the 8th ACM Conference on Web Science - WebSci '16, 22 May 2016 - 25 May 2016.
- Challenges of Evaluating Sentiment Analysis Tools on Social Media. Proceedings of the Tenth International Conference on Language Resources and Evaluation, 23 May 2016 - 28 May 2016.
- . Proceedings of the 10th International Conference on Ubiquitous Information Management and Communication - IMCOM '16, 4 January 2016 - 6 January 2016.
- . Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, September 2015 - September 2015.
- . Proceedings of the ACM Web Science Conference on ZZZ - WebSci '15, 28 June 2015 - 1 July 2015.
- . Proceedings of EnviroInfo and ICT for Sustainability 2015, 7 September 2015 - 9 September 2015.
- Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis.. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). Reykjavik, 26 May 2014 - 31 May 2014.
- (pp 26-41)
- Introduction. SWAIE 2014 - 3rd Workshop on SemanticWeb and Information Extraction, Proceedings of the Workshop (pp III)
- . HT 2013 - Proceedings of the 24th ACM Conference on Hypertext and Social Media (pp 21-30)
- TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text. Proceedings of the International Conference on Recent Advances in Natural Language Processing
- . Procedia Computer Science, Vol. 22 (pp 231-240)
- Multimodal sentiment analysis of social media. CEUR Workshop Proceedings, Vol. 1110 (pp 47-58)
- Knowledge extraction and consolidation from social media (KECSM 2012) :Preface. CEUR Workshop Proceedings, Vol. 895
- Large Scale Semantic Annotation, Indexing, and Search at The National Archives. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (pp 3487-3494)
- Entity extraction and consolidation for social web content preservation. CEUR Workshop Proceedings, Vol. 912 (pp 18-29)
- Using events for content appraisal and selection in Web archives. CEUR Workshop Proceedings, Vol. 779 (pp 98-107)
- . 2009 IEEE Conference on Commerce and Enterprise Computing, CEC 2009 (pp 476-482)
- Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection.. LREC
- Benchmarking Textual Annotation Tools for the Semantic Web. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 (pp 20-25)
- Ontology-based information extraction for business intelligence. SEMANTIC WEB, PROCEEDINGS, Vol. 4825 (pp 843-856)
- Natural language technology for information integration in business intelligence. BUSINESS INFORMATION SYSTEMS, PROCEEDINGS, Vol. 4439 (pp 366-380)
- Creating tools for morphological analysis of sumerian. Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006 (pp 1762-1765)
- Metrics for evaluation of ontology-based information extraction. EON 2006 - Evaluation of Ontologies for the Web: 4th International Workshop - Located at the 15th International World Wide Web Conference, WWW 2006
- Metrics for evaluation of ontology-based information extraction. CEUR Workshop Proceedings, Vol. 179
- Ontology-based information extraction for market monitoring and technology watch. CEUR Workshop Proceedings, Vol. 137 (pp 33-42)
- Extracting a domain ontology from linguistic resource based on relatedness measurements. 2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings (pp 345-351)
- . DATA & KNOWLEDGE ENGINEERING, Vol. 48(2) (pp 247-264)
- A lightweight approach to coreference resolution for named entities in text. Anaphora Processing, Vol. 263 (pp 97-111)
- Populating a database from parallel texts using ontology-based information extraction. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, Vol. 3136 (pp 254-264)
- Automatic language-independent induction of gazetteer lists. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 (pp 709-712)
- Creation of reusable components and language resources for Named Entity Recognition in Russian. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 (pp 309-312)
- Using parallel texts to improve recall in botany. Recent Advances in Natural Language Processing III, Vol. 260 (pp 237-246)
- Automatic creation and monitoring of semantic metadata in a dynamic knowledge portal. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, PROCEEDINGS, Vol. 3192 (pp 65-74)
- . Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references -, 31 May 2003.
- Rapid customization of an information extraction system for a surprise language.. ACM Trans. Asian Lang. Inf. Process., Vol. 2 (pp 295-300)
- . Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - EACL '03, 12 April 2003 - 17 April 2003.
- NE recognition without training data on a language you don’t speak. ACL Workshop on Multilingual and Mixed-language Named Entity Recognition: Combining Statistical and Symbolic Models. Sapporo, Japan
- GATE: A Unicode-based Infrastructure Supporting Multilingual Information Extraction. Proceedings of Workshop on Information Extraction for Slavonic and other Central and Eastern European Languages (IESL’03). Borovets, Bulgaria
- . Proceedings of the HLT-NAACL 2003 workshop on Software engineering and architecture of language technology systems - SEALTS '03, 31 May 2003 - 31 May 2003.
- GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL’02). Philadelphia, USA
- . Proceedings of the ACL-02 Workshop on Automatic Summarization -, 11 July 2002 - 12 July 2002.
- A framework and graphical development environment for robust NLP tools and applications.. ACL (pp 168-175)
- (pp 613-625)
- A unicode-based environment for creation and use of language resources. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 66-71)
- . Proceedings of the ACL-02 Workshop on Effective tools and methodologies for teaching natural language processing and computational linguistics -, 7 July 2002 - 7 July 2002.
- Extracting information for automatic indexing of multimedia material. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 669-676)
- How feasible is the reuse of grammars for Named Entity Recognition?. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 1412-1418)
- GATE: an architecture for development of robust HLT applications. 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE (pp 168-175)
- Developing reusable and robust language processing components for information systems using GATE. 13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS (pp 223-227)
- Adapting a robust multi-genre NE system for automatic content extraction. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS AND APPLICATIONS, PROCEEDINGS, Vol. 2443 (pp 264-273)
- Access to multimedia information through multisource and multilanguage information extraction. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, Vol. 2553 (pp 160-171)
- . Proceedings of the 40th Annual Meeting on Association for Computational Linguistics - ACL '02, 7 July 2002 - 12 July 2002.
- Named Entity Recognition from Diverse Text Types. Recent Advances in Natural Language Processing 2001 Conference (pp 257-274-257-274). Tzigov Chark, Bulgaria
- . Proceedings of the 18th conference on Computational linguistics -, 31 July 2000 - 4 August 2000.
- Experience of using GATE for NLP R&D. Proceedings of the Workshop on Using Toolsets and Architectures To Build NLP Systems at COLING-2000. Luxembourg
- Creating and using domain-specific ontologies for terminological applications. 2nd International Conference on Language Resources and Evaluation, LREC 2000
- Comparing Topic-Aware Neural Networks for Bias Detection of News. Proceedings of ECAI 2020, 29 August 2020 - 2 September 2020.
- Combining expert knowledge with NLP for specialised applications.. Proc. of 23rd International Conference on Text, Speech and Dialogue. Brno, 29 August 2020 - 2 September 2020.
- Climate Change: A Chance for Political Re-Engagement?. Political Studies Association 65th Annual International Conference. 91Ö±²¥
Datasets
Preprints
- .
- , arXiv.
- Local Media and Geo-situated Responses to Brexit: A Quantitative Analysis of Twitter, News and Survey Data, arXiv.
- Helping Crisis Responders Find the Informative Needle in the Tweet Haystack, arXiv.
- Analysis of Named Entity Recognition and Linking for Tweets, arXiv.
- Online Abuse of UK MPs in 2015 and 2017: Perpetrators, Targets, and Topics.
- .
- .
- Research group
-
Member of the research group.
- Grants
-
Current grants
- Influencing policy work on human rights violations against journalists, Research England, 09/2024 - 06/2025, £34,667, as PI
- Toolkit for Analysing and Visualising Online Violence Against Female Journalists, EPSRC, 04/2024 - 03/2025, £45,363, as PI
- Atrium: Advancing FronTier Research In the Arts and hUManities, Horizon Europe, 01/2024 - 12/2027, £370,950, as PI
Previous grants
- RISIS2: , EC H2020, 01/2019 - 12/2022, £476,741, as co-PI
- Visualising the environmental impacts of plant-based recipes in Europe, Research England, 12/2021 - 05/2022, £18,407, as PI
- Calculating the environmental impact of plant based recipes, Industrial, 01/2021 - 12/2021, £2,500, as PI
- Pilot project on developing and trialling a toolkit for strengthening national context monitoring of violations against journalists, Free Press, 06/2020 - 12/2020, £29,094, as Co-PI
- Pilot project on developing a database for the improved collection and systematisation of information on incidents of violations against journalists, Free Press, 04/2019 - 11/2019, £29,030, as Co-I
- The Intelligent Automation of Contract Analysis of Collateral Warranties, Innovate UK, 03/2019 - 08/2020, £114,552, as PI
- Social Understandings of Scale: The role of Print and Social Media in the EU Referendum Debate, British Academy, 01/2018 - 06/2019, £49,716, as Co-PI
- Improving the monitoring of violence against journalists, Free Press, 12/2017 - 10/2018, £26,589, as Co-I
- KNOWMAK: , EC H2020, 01/2017 - 12/2019, £196,654, as PI
- COMRADES: , EC H2020, 01/2016 - 12/2018, £257,000, as PI