Czech Large Language Model CSMPT-7B
Czech Large Language Model CSMPT-7B In March 2024, we publicly released the first Czech-only large language model csmpt7b. Our language model was trained on dataset collected from Czech internet, Internet Archive, and also on publicly available historical texts ranging from the year 1850 until now. The texts were transcribed using our Pero OCR system. Training […]
Czech Large Language Model CSMPT-7B Read More »