DB Diagram to JSON

Why This Project?

While experimenting with Qwen2.5-VL across various vision-language tasks, I discovered that the base model struggled significantly with structured diagram interpretation. When I fed it ER diagrams and database schemas, the results were inconsistent and often incomplete - missing tables, incorrectly identifying relationships, or producing malformed JSON outputs.

Let's look at the overview of the qwen model.

Model Architecture Overview

Before diving into the solution, let's peek under the hood of what we're working with:

The architecture is quite complex:

A Vision Encoder that "sees" the diagram
A Vision-Language Adapter that translates visual patterns into something the language model understands
The Qwen2.5 transformer backbone that generates the structured output
All working together through multi-modal fusion

Onto the finetuning part

The strategy

I could have gone for full model fine-tuning, but that would be like using a sledgehammer to crack a nut. Instead, I opted for LoRA (Low-Rank Adaptation) – a clever technique that adds small, trainable modules to the existing model. It's efficient, effective, and doesn't require a supercomputer.

My configuration:

Target Modules: The attention layers (q_proj, v_proj, k_proj, o_proj)
LoRA rank: 16 (a sweet spot between performance and efficiency)
LoRA alpha: 32 (for stable training)

The Training Process

I monitored everything using Comet ML.

You can find the complete training process in my finetuning.ipynb notebook if you want to replicate the results.

The Results

The fine-tuned model shows significant improvements over the base model:

Metric	Base Qwen2.5-VL	Fine-tuned Model
Table Detection Accuracy	0.0%	89.2%
Relationship Accuracy	0%	90%

Nearly 90% accurate across the board.
The fine-tuned model achieved significantly higher accuracy in identifying the number of tables (approximately 89.2%) and relationships (90.0%) compared to the base model.
The base model was unable to produce valid output for this task, resulting in 0.0% accuracy for both table and relationship counts on the benchmark subset.

Comparision between Base Model vs Fine-tuned Model

Our input:

Base Model Output

{
  "tables": [
    {
      "name": "grades",
      "columns": [
        { "name": "id", "type": "INTEGER", "is_pk": true },
        { "name": "salary", "type": "DECIMAL_12_2" },
        { "name": "quantity", "type": "INTEGER" },
        { "name": "phone", "type": "VARCHAR_20" },
        { "name": "name", "type": "VARCHAR_255", "is_not_null": true },
        { "name": "date_of_birth", "type": "DATE" },
        { "name": "rating", "type": "FLOAT" },
        { "name": "enrollment_id", "type": "INTEGER", "is_not_null": true }
      ],
      "primary_key": "id"
    },
    {
      "name": "enrollments",
      "columns": [
        { "name": "id", "type": "INTEGER", "is_pk": true },
        { "name": "updated_at", "type": "TIMESTAMP" },
        { "name": "date_of_birth", "type": "DATE" },
        { "name": "description", "type": "TEXT" },
        { "name": "email", "type": "VARCHAR_255", "is_unique": true },
        { "name": "price", "type": "DECIMAL_10_2" },
        { "name": "is_active", "type": "BOOLEAN" },
        { "name": "student_id", "type": "INTEGER", "is_not_null": true },
        { "name": "course_id", "type": "INTEGER", "is_not_null": true }
      ],
      "foreign_keys": [
        { "table": "courses", "column": "course_id", "foreign_table": "enrollments", "foreign_column": "id" },
        { "table": "students", "column": "student_id", "foreign_table": "enrollments", "foreign_column": "id" }
      ]
    },
    {
      "name": "students",
      "columns": [
        { "name": "id", "type": "INTEGER", "is_pk": true },
        { "name": "date_of_birth", "type": "DATE" },
        { "name": "email", "type": "VARCHAR_255", "is_unique": true },
        { "name": "is_active", "type": "BOOLEAN" },
        { "name": "rating", "type": "FLOAT" },
        { "name": "name", "type": "VARCHAR_255", "is_not_null": true },
        { "name": "updated_at", "type": "TIMESTAMP" },
        { "name": "address", "type": "TEXT" }
      ],
      "primary_key": "id"
    },
    {
      "name": "courses",
      "columns": [
        { "name": "id", "type": "INTEGER", "is_pk": true },
        { "name": "created_at", "type": "TIMESTAMP" },
        { "name": "address", "type": "TEXT" },
        { "name": "salary", "type": "DECIMAL_12_2" },
        { "name": "price", "type": "DECIMAL_10_2" },
        { "name": "name", "type": "VARCHAR_255", "is_not_null": true },
        { "name": "email", "type": "VARCHAR_255", "is_unique": true },
        { "name": "teacher_id", "type": "INTEGER", "is_not_null": true },
        { "name": "department_id", "type": "INTEGER", "is_not_null": true }
      ],
      "foreign_keys": [
        { "table": "teachers", "column": "teacher_id", "foreign_table": "courses", "foreign_column": "id" },
        { "table": "departments", "column": "department_id", "foreign_table": "courses", "foreign_column": "id" }
      ]
    },
    {
      "name": "teachers",
      "columns": [
        { "name": "id", "type": "INTEGER", "is_pk": true },
        { "name": "name", "type": "VARCHAR_255", "is_not_null": true },
        { "name": "description", "type": "TEXT" },
        { "name": "quantity", "type": "INTEGER" },
        { "name": "is_active", "type": "BOOLEAN" },
        { "name": "date_of_birth", "type": "DATE" },
        { "name": "address", "type": "TEXT" },
        { "name": "rating", "type": "FLOAT" }
      ]
    },
    {
      "name": "departments",
      "columns": [
        { "name": "id", "type": "INTEGER", "is_pk": true },
        { "name": "salary", "type": "DECIMAL_12_2" },
        { "name": "date_of_birth", "type": "DATE" },
        { "name": "is_active", "type": "BOOLEAN" },
        { "name": "quantity", "type": "INTEGER" }
      ]
    }
  ],
  "data_constraints": [
    { "table": "students", "column": "name", "constraint_type": "NOT_NULL" },
    { "table": "courses", "column": "name", "constraint_type": "NOT_NULL" },
    { "table": "teachers", "column": "name", "constraint_type": "NOT_NULL" },
    { "table": "departments", "column": "name", "constraint_type": "NOT_NULL" }
  ]
}

The base model produced a verbose, confused output with self-referential foreign keys and inconsistent constraint definitions. It was trying to be helpful but clearly didn't understand the visual language of database diagrams.

Fine-tuned Model Output

{
  "domain": "school",
  "source": "programmatic",
  "tables": [
    {
      "name": "students",
      "columns": [
        {
          "name": "id",
          "type": "INTEGER",
          "primary_key": true
        },
        {
          "name": "date_of_birth",
          "type": "DATE"
        },
        {
          "name": "email",
          "type": "VARCHAR(255)",
          "unique": true
        },
        {
          "name": "is_active",
          "type": "BOOLEAN",
          "default": "true"
        },
        {
          "name": "rating",
          "type": "FLOAT"
        },
        {
          "name": "name",
          "type": "VARCHAR(255)",
          "nullable": false
        },
        {
          "name": "updated_at",
          "type": "TIMESTAMP",
          "default": "CURRENT_TIMESTAMP"
        },
        {
          "name": "address",
          "type": "TEXT"
        }
      ]
    }
  ],
  "relationships": [
    {
      "from_table": "enrollments",
      "to_table": "students",
      "from_column": "student_id",
      "to_column": "id",
      "type": "many_to_one"
    },
    {
      "from_table": "courses",
      "to_table": "teachers",
      "from_column": "teacher_id",
      "to_column": "id",
      "type": "many_to_one"
    }
  ]
}

Clean. Accurate!

Differences

The first output (Base Model) provides a relatively structured JSON schema of the database, detailing tables, columns, data types, and some constraints. However, it suffers from several issues.
For example, foreign key relationships are inaccurately defined, with several self-references that misrepresent actual relational links between tables. Also, Constraints like NOT_NULL and UNIQUE are inconsistently applied.
In contrast, the second output is more logically consistent, cleanly structured, and directly usable. It embeds constraints such as primary_key, nullable, default, and unique within each column definition, making the schema easier to read and maintain. Foreign key relationships are explicitly and correctly defined under a relationships section using a normalized format, improving both accuracy and semantic transparency. Overall, the second schema is more complete, logically valid, and production-ready, making it the superior of the two.

Inference

Try the model yourself with this notebook:

Quick Start with Colab

Get the Model

The fine-tuned model is available on Hugging Face: zodiac2525/Qwen2.5-VL-Diagrams2SQL

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
configs		configs
data		data
inference		inference
notebooks		notebooks
public		public
src/data		src/data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DB Diagram to JSON

Why This Project?

Model Architecture Overview