What types of AI datasets does CCC provide?

CCC provides multilingual AI datasets including conversational text data, speech data collection and transcription, parallel corpora (MTPE), domain-specific datasets, structured knowledge corpora, and scripted or synthetic datasets for AI training and evaluation.

By |January 29th, 2024|Categories: |

Which languages do you support for AI data projects?

We support Southeast Asian, Japanese, and global languages, including Tagalog, Cebuano, Indonesian, Malaysian, Japanese, Vietnamese, Thai, Tamil, Bengali, French, Italian, and Russian. We also provide rare and low-resource language support at scale for emerging markets, including Armenian, Georgian, Telugu, and more.

By |January 29th, 2024|Categories: |
Go to Top