Skip to content

Add Training Script for Model Fine-tuning #3

@noor05-creator

Description

@noor05-creator

Problem

The repository currently only contains pre-trained LoRA adapter weights but no training code. This makes it impossible to reproduce the reported 41% GSM8K accuracy or experiment with different configurations.

Proposed Solution

I would like to contribute a complete training script (train.py) that implements:

  • QLoRA fine-tuning on GSM8K dataset
  • 4-bit quantization for memory efficiency
  • Works on free Colab T4 GPU
  • Matches existing adapter_config.json parameters
  • Configurable command-line arguments

What I'll Add

  • train.py - Complete training pipeline with QLoRA

Expected Outcome

Users can reproduce the 41% accuracy in ~1.5 hours on T4 GPU.

I would like to work on this under OScG'26.

@AshChadha-iitg Please assign this issue to me. Thank you!

Metadata

Metadata

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions