Skip to content
#

continued-pretraining

Here are 11 public repositories matching this topic...

This project evaluates Llama 3.2 3B continued pre-training for Serbian language, using a custom-made cloze-style benchmark. It supports grammatical, lexical, semantic, idiomatic, and factual sentence completion tasks. The evaluation script calculates model accuracy based on log-likelihood scoring over masked token choices.

  • Updated Jun 19, 2025
  • Python

Controlled study: does continued pre-training on SEC 10-K filings help downstream financial QA? A clean negative result on a fair evaluation instrument. Qwen2.5-3B; FinQA/TAT-QA; CPT (LoRA and full-parameter), SFT, DPO.

  • Updated Jun 18, 2026
  • Python

Improve this page

Add a description, image, and links to the continued-pretraining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the continued-pretraining topic, visit your repo's landing page and select "manage topics."

Learn more