Add Training Script for Model Fine-tuning

## Problem
The repository currently only contains pre-trained LoRA adapter weights but no training code. This makes it impossible to reproduce the reported 41% GSM8K accuracy or experiment with different configurations.

## Proposed Solution
I would like to contribute a complete training script (train.py) that implements:
- QLoRA fine-tuning on GSM8K dataset
- 4-bit quantization for memory efficiency  
- Works on free Colab T4 GPU
- Matches existing adapter_config.json parameters
- Configurable command-line arguments

## What I'll Add 
- `train.py` - Complete training pipeline with QLoRA


## Expected Outcome
Users can reproduce the 41% accuracy in ~1.5 hours on T4 GPU.

I would like to work on this under **OScG'26**.

@AshChadha-iitg  Please assign this issue to me. Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Training Script for Model Fine-tuning #3

Problem

Proposed Solution

What I'll Add

Expected Outcome

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add Training Script for Model Fine-tuning #3

Description

Problem

Proposed Solution

What I'll Add

Expected Outcome

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions