Direkt zum Inhalt

Balluff, Maximilian ; Auch, Maximilian ; Mandl, Peter ; Wolff, Christian

Lyriccovers 2.0: An Enhanced Dataset for Cover Song Analysis

Balluff, Maximilian, Auch, Maximilian , Mandl, Peter und Wolff, Christian (2024) Lyriccovers 2.0: An Enhanced Dataset for Cover Song Analysis. IADIS International Journal on WWW/Internet 22 (2), S. 75-92.

Veröffentlichungsdatum dieses Volltextes: 20 Mrz 2025 17:40
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.75238


Zusammenfassung

This research offers a detailed examination of a novel dataset that collates original musical compositions alongside their derivative cover versions. Unique in its inclusion of both links to YouTube as well as and lyrical content, the dataset enlists more than 78,000 tracks, encompassing more than 24,000 cover song groupings. It stands as the most diverse compendium of cover songs currently ...

This research offers a detailed examination of a novel dataset that collates original musical compositions alongside their derivative cover versions. Unique in its inclusion of both links to YouTube as well as and lyrical content, the dataset enlists more than 78,000 tracks, encompassing more than 24,000 cover song groupings. It stands as the most diverse compendium of cover songs currently available for study. The characteristics of the LyricCovers dataset are thoroughly analyzed through its metadata, and empirical evaluations in the subsequent experimental lyrics analysis section suggest that lyrical analysis is a fundamental component in the identification and study of cover songs. This work presents a baseline approach to cover song detection, with an emphasis on lyrical content processing. It describes the extraction of lyrics from the audio files and the application of the Jina Embeddings 2 Model, fine-tuned with a hard triplet-loss objective, which successfully exploits lyric similarity to accurately identify cover songs.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftIADIS International Journal on WWW/Internet
Verlag:IADIS
Band:22
Nummer des Zeitschriftenheftes oder des Kapitels:2
Seitenbereich:S. 75-92
Datum2024
InstitutionenSprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff)
Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff)
Stichwörter / KeywordsCover Song Detection, Music Information Retrieval, Dataset, Lyrics
Dewey-Dezimal-Klassifikation000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenJa
URN der UB Regensburgurn:nbn:de:bvb:355-epub-752386
Dokumenten-ID75238

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben