Skip to content
View joaoaleite's full-sized avatar

Block or report joaoaleite

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ToLD-Br ToLD-Br Public

    Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis

    Jupyter Notebook 47 9

  2. PASTEL PASTEL Public

    PASTEL (Prompted weAk Supervision wiTh crEdibility signaLs) is a weakly supervised approach that leverages large language models to extract credibility signals from web content, then further combin…

    Jupyter Notebook 2

  3. euvsdisinfo euvsdisinfo Public

    This repository allows collecting the EUvsDisinfo dataset and reproducing the research experiments presented in the paper 'EUvsDisinfo: A Dataset for Multilingual Detection of Pro-Kremlin Disinform…

    Python 1 1

  4. pmd pmd Public

    Repository for the WWW 2025 paper 'A Cross-Domain Study of The Use of Persuasion Techniques in Online Disinformation'

    Jupyter Notebook 1

  5. simple-twitter-collector simple-twitter-collector Public

    A simple tweet collector wrapper built on top of tweepy and optionally integrated with Google Drive. It will collect all the tweet, user, media and location fields and dump it to a json file.

    Python 7 1