Skip to content

EX3exp/Kpop-lyric-datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

14 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎡 Kpop-lyric-datasets

There are json-formated datas of 25696 k-pop songs, which was from Melon's Monthly Chart Ranking 100 (2000 ~ 2023 Oct.).
Also providing python functions for data handling.

I DO NOT claim any ownership of this dataset, All copyrights belong to the authors, of each song.
You can freely use this dataset on RESEARCH PURPOSE, but if you want to use COMMERCIALLY this dataset, You should Talk with Lyricists, Artists, Composers, etc.

πŸ€” 1. How to Use

Clone this repository into your workspace.

git clone https://github.com/EX3exp/Kpop-lyric-datasets.git

πŸ“– 2. Usage

πŸ€” A. Getting 2023's whole data into Dictionary

from utils import data_parser
your_dict = data_parser.get_dict(2023)

πŸ€” B. Getting whole data from 2010 to 2022, into Pandas.DataFrame

from utils import data_parser
your_dataframe = data_parser.get_df(2010, 2022)

πŸ“– 3. Structure

πŸ”½ Introducing data's structure with melon-monthly_2023-07_16.json - πŸ‡ (Attention - New Jeans).

πŸ”– 0. Path of jsons

melon\monthly-chart\melon-<year>\melon-<year>-<month>\melon-monthly_<year>-<month>_<chart rank>.json

πŸ”– 1. info

Metadata of this song.

{
    "info": [
        {
            "year": 2023,
            "month": 7,
            "rank": 16,
            "type": "μ›”λ³„μ°¨νŠΈ",
            "website": "Melon"
        },
        ""
    ],

πŸ”– 2. song_id

Id of this song, which is in Melon Database.

"song_id": "35454425",

πŸ”– 3. song_name

Name of this song.

"song_name": "Attention",

πŸ”– 4. album

Album's name.

"album": "NewJeans 1st EP 'New Jeans'",

πŸ”– 5. release_date

Song's released date.

"release_date": "2022.08.01",

πŸ”– 6. artist

Artist of this song.

"artist": "NewJeans",

πŸ”– 7. genre

Genre of this song, which was specified in Melon.

"genre": "λŒ„μŠ€",

πŸ”– 8. lyric_writer

Lyricist of this song.

"lyric_writer": "Gigi",

πŸ”– 9. composer

Composer of this song.

"composer": "Duckbay (Cosmos Studios Stockholm)",

πŸ”– 10. arranger

Arranger of this song.

"arranger": "λ‹€λ‹ˆμ—˜(DANIELLE)",

πŸ”– 11. lyrics

Lyrics are separated per line.

1) row_num

Number of lyrics' rows.

 "lyrics": {
    "row_num": 79,

2) lines

Whole lyrics of this song.

    "lines": [
            "You and me",
            "λ‚΄ 맘이 보이지",
            "ν•œμ°Έμ„ 쳐닀봐",
            "κ°€κΉŒμ΄ λ‹€κ°€κ°€",
            "You see",
            "You see, ey ey ey ey",
            "",
            "One, two, three",
            "μš©κΈ°κ°€ 생겼지",
            "이미 μ•„λŠ” λ„€ 눈치",
            "고개λ₯Ό 돌렀 천천히",
            "μ—¬κΈ°",
            "You see",
            "μ—¬κΈ° λ³΄μ΄λ‹ˆ",
            "",
            "Looking for attention λ„ˆμ•Όκ² μ–΄",
            "ν™•μ‹€ν•˜κ²Œ λ‚˜λ‘œ λ§Œλ“€κ² μ–΄",
            "Stop, eyyy",
            "Drop the question",
            "Drop the, drop the question",
            "Want attention",
            "Wanna want attention",
            "",
            "You give me butterflies you know",
            "λ‚΄ λ§˜μ€ μ˜¨ν†΅ paradise",
            "κΏˆμ—μ„œ κΉ¨μ›Œμ£Όμ§€ 마",
            "",
            "You got me looking for attention",
            "You got me looking for attention",
            "가끔은 정말",
            "ν—·κ°ˆλ¦¬μ§€λ§Œ",
            "λΆ„λͺ…ν•œ 건",
            "Got me looking for attention",
            "",
            "널 μš°μ—°νžˆ 마주친 μ²™ν• λž˜",
            "λͺ» λ³Έ μ²™ μ§€λ‚˜κ°ˆλž˜",
            "You’re so fine",
            "Gotta gotta get to know ya",
            "λ‚˜μ™€ λ‚˜μ™€ κ±Έμ–΄κ°€ 쀘",
            "",
            "μ§€κΈˆ λŒμ•„μ„œλ©΄",
            "I need ya, need ya, need ya",
            "To look at me back",
            "Hey λ‹€ λ“€μΌ°μ—ˆλ‚˜",
            "널 보면 ν•˜νŠΈκ°€ νŠ€μ–΄λ‚˜μ™€",
            "",
            "λ‚œ 사탕을 μ°ΎλŠ” baby (baby)",
            "λ‚΄ λ§˜μ€ μ„€λ ˆμ΄μ§€",
            "Eyyy, drop the question",
            "Drop the, drop the question",
            "Want attention",
            "Wanna want attention",
            "",
            "You give me butterflies you know",
            "λ‚΄ λ§˜μ€ μ˜¨ν†΅ paradise",
            "κΏˆμ—μ„œ κΉ¨μ›Œμ£Όμ§€ 마",
            "",
            "You got me looking for attention",
            "You got me looking for attention",
            "가끔은 정말",
            "ν—·κ°ˆλ¦¬μ§€λ§Œ",
            "λΆ„λͺ…ν•œ 건",
            "Got me looking for attention",
            "",
            "You got me looking for attention",
            "You got me looking for attention",
            "가끔은 정말",
            "ν—·κ°ˆλ¦¬μ§€λ§Œ",
            "λΆ„λͺ…ν•œ 건",
            "Got me looking for attention",
            "",
            "A T T E N T I on",
            "Attention is what I want",
            "A T T E N T I on",
            "Attention is what I want",
            "A T T E N T I on",
            "Attention is what I want",
            "A T T E N T I on",
            "You got me looking for attention"
        ]

πŸ”– Whole Json πŸ‡

{
    "info": [
        {
            "year": 2023,
            "month": 7,
            "rank": 16,
            "type": "μ›”λ³„μ°¨νŠΈ",
            "website": "Melon"
        },
        ""
    ],
    "song_id": "35454425",
    "song_name": "Attention",
    "album": "NewJeans 1st EP 'New Jeans'",
    "release_date": "2022.08.01",
    "artist": "NewJeans",
    "genre": "λŒ„μŠ€",
    "lyric_writer": "Gigi",
    "composer": "Duckbay (Cosmos Studios Stockholm)",
    "arranger": "λ‹€λ‹ˆμ—˜(DANIELLE)",
    "lyrics": {
        "row_num": 79,
        "lines": [
            "You and me",
            "λ‚΄ 맘이 보이지",
            "ν•œμ°Έμ„ 쳐닀봐",
            "κ°€κΉŒμ΄ λ‹€κ°€κ°€",
            "You see",
            "You see, ey ey ey ey",
            "",
            "One, two, three",
            "(... etc.)"
        ]
    }
}

Releases

No releases published

Packages

No packages published

Languages