MultiMed-ST

Published in Hugging Face, 2025

This dataset is an extended version of leduckhai/MultiMed.

Dataset Details

Dataset Description

  • Languages (NLP): Vietnamese (vn), English (en), Traditional Chinese (zh_TW), Simplified Chinese (zh_CN), German (de), French (fr)

Dataset Sources

Dataset Creation

Who are the developers?

Bui Nguyen Kim Hai (Eötvös Loránd University, Hungary) -- Main Developer
Bach Phan Tat (KU Leuven, Belgium) -- Advisor
Khai Le-Duc (University of Toronto, Canada) -- Advisor

Who are the annotators?

Thanh-Thuy Nguyen (HCMC Open University, Vietnam)
Ly Nguyen (IÉSEG School of Management, France)
Tuan-Minh Phan (Technical University Dortmund, Germany)
Phuong Tran (University of Hertfordshire, UK)

Dataset Card Contact / Dataset Card Author

Bui Nguyen Kim Hai
Eötvös Loránd University, Hungary
Email: htlulem185@gmail.com