Dimosthenis Antypas

I am a part-time PhD Student at Cardiff University with my main research revolving around NLP and social media. By utilising soa NLP techniques I aim to develop tools and methodologies that can help us understand and explain human behaviour in social media, as well as identify common issues that are present in them such as misinformation. I also work part-time as a Research Assistant at Cardiff University and I am a member of the Cardiff NLP team.


Education

Cardiff University

PhD in Natural Language Processing

Thesis: ”Identifying misinformation in social media.”
Utilising data extracted from social media (i.e., Twitter, Reddit) and state-of-the-art machine learning models (e.g. BERT, RoBERTa), a framework is aimed to be created that will help identify online posts that contain misinformation as well as quantify the influence they have on their respected social networks.

(part-time) January 2021 - present

Cardiff University

MSc in Artificial Intelligence

Distinction

Dissertation: ”Analysis of Misinformation in COVID-19 Tweets.”
In this project, Twitter data related to the COVID-19 pandemic were gathered and analysed with primary objective to discover patterns of misinformation in tweets and develop machine learning models (SVM, Naive Bayes, CNN) to identify ”fake news” using lexical features

September 2019 - September 2020

Cardiff University

BSc in Computer Science

Honours

Final Year Project : ”Tracking Beliefs and Trust in Social Networks.”
Simulated social networks as Belief Revision Games (BRGs) in order to study the propagation of beliefs in them. By using BRGs we are able to formalise the problem and with the combination of propositional logic and belief revision theory to study the dynamics and development of the network.The project was implemented using Java

September 2016 - August 2019

Publications

SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research.
Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Leonardo Neves, Kiamehr Rezaee, Luis Espinosa-Anke, Jiaxin Pei, Jose Camacho-Collados
EMNLP 2023 Findings

Robust hate speech detection in social media: A cross-dataset empirical evaluation.
Dimosthenis Antypas and Jose Camacho-Collados
WOAH, ACL 2023

TweetNLP: Cutting-Edge Natural Language Processing for Social Media
Jose Camacho-Collados, Kiamehr Rezaee, Talayeh Riahi, Asahi Ushio, Daniel Loureiro, Dimosthenis Antypas, Joanne Boisson, Luis Espinosa-Anke, Fangyu Liu, Eugenio Martínez-Cámara, Gonzalo Medina, Thomas Buhrmann, Leonardo Neves, Francesco Barbieri
EMNLP Demo 2022

Twitter Topic Classification
Dimosthenis Antypas*, Asahi Ushio*, Jose Camacho-Collados, Leonardo Neves, Vítor Silva, Francesco Barbieri
COLING 2022

Politics and Virality in the Time of Twitter: A Large-Scale Cross-Party Sentiment Analysis in Greece, Spain and United Kingdom
Dimosthenis Antypas, Alun Preece, Jose Camacho-Collados
Online Social Networks and Media 2023

Deriving Disinformation Insights from Geolocalized Twitter Callouts
David Tuxworth, Dimosthenis Antypas, Luis Espinosa-Anke, Jose Camacho-Collados, Alun Preece, David Rogers
WIT: Workshop On Deriving Insights From User-Generated Text
KDD 2021

COVID-19 and Misinformation: A Large-Scale Lexical Analysis on Twitter
Dimosthenis Antypas, Jose Camacho-Collados, Alun Preece, David Rogers
Student Research Workshop, ACL 2021


Experience

Research Assistant

Cardiff University

• Contributed and supported research publications made within the Cardiff NLP team.
• Prepared material and presented research at meetings/events organised.
• Maintained and kept up-to-date existing NLP tools created.

(part-time) November 2022 - present

Teaching Associate

• Contributed in the delivery of lectures and tutorials for MSc and BSc modules related with Data Science & AI.
• Heavily involved in designing & creating an online course about Applications of Machine Learning.
• Supervised and guided BSc & MSc students for their dissertation projects.

(part-time) January 2021 - October 2022

Machine Learning Software Engineer

Hypercascade

• Worked as part of a small, fast paced team.
• Contributed on improving existing codebase and machine learning models.
• Utilised tools such as Python (pandas, scikit-learn, gensim), ReactJS, and GIT.
• Worked with large amount of real world data (free-form text & web crawl dumps).

September 2020 - December 2020

Research Assistant

Cardiff University

• Worked as part of a team to develop an application for Automating regulatory compliance in the construction sector.
• Conducted extensive background research on relevant scientific work and produced necessary documentation.
• Utilised ReactJS and nodeJS along with orientDB to develop a robust and easy to use web application.

June 2018 - August 2018

Skills

Programming Languages & Tools
Data Science & AI
  • scikit-learn
  • tensorflow-keras
  • pytorch
  • transformers
Open source contributions