| License: Creative Commons Attribution Non-commercial No Derivatives 4.0 PDF - Published Version (1MB) |
- URN to cite this document:
- urn:nbn:de:bvb:355-epub-774466
- DOI to cite this document:
- 10.5283/epub.77446
Alternative links to fulltext:DOI
Abstract
This study introduces a new approach to determine whether GitHub repositories are professional or exploratory by analyzing README.md files. We crawled and manually labeled a dataset that contains over 200 repositories to evaluate various classification methods. We compared state-of-the-art Large Language Models (LLM) against traditional Natural Language Processing (NLP) techniques, including term ...

Owner only: item control page

Download Statistics