# DISTINT_open_data **Repository Path**: distint_JNU/DISTINT_open_data ## Basic Information - **Project Name**: DISTINT_open_data - **Description**: The repository of open-source datasets created by the DISTINT group, JNU - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-04-14 - **Last Updated**: 2025-04-14 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # DISTINT_open_data The collection of open-source datasets created by the DISTINT group, JNU ## Dataset I: Human vs. AI-translated Multilingual Corpora (HAM) **Description**: High-quality multilingual parallel corpora (mostly public) paired with LLM-generated two-way translation. This dataset can be used for multiple purposes such as machine translation detection, translation fidelity evaluation, etc. **Portal**: https://github.com/wingter562/DISTINT_open_datasets-HAM.git