iDRAMA-rumble-2024: A Dataset of Podcasts from Rumble Spanning 2020 to

Balci, Utkucan; Patel, Jay; Balci, Berkan; Blackburn, Jeremy

Citation Details

Rumble has emerged as a prominent platform hosting contro- versial figures facing restrictions on YouTube. Despite this, the academic community’s engagement with Rumble has been minimal. To help researchers address this gap, we intro- duce a comprehensive dataset of about 6.7K podcast videos from August 2020 to December 2022, amounting to over 5.6K hours of content. Besides covering metadata of these podcast videos, we provide speech-to-text transcriptions for future analysis. We also provide speaker diarization informa- tion, a collection of 168K unique representative images from podcast videos, and face embeddings of more than 400K ex- tracted faces. With the rise of the influence of podcasts and populist figures, this dataset provides a rich resource to iden- tify challenges in cyber social threats in a relatively underex- plored space. more »

Award ID(s):: 2046590

PAR ID:: 10586413

Author(s) / Creator(s):: Balci, Utkucan; Patel, Jay; Balci, Berkan; Blackburn, Jeremy

Publisher / Repository:: International Workshop on Cyber Social Threats 2024

Date Published:: 2024-06-03

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Workshop Report:
The DOI is not currently available.

More Like this