SQLAlchemy destination + duckdb_engine fails with "Schema with name "{name}" already exists!"

### dlt version

1.26.0

### Describe the problem

When using the SQLAlchemy destination to write to DuckDB (via duckdb_engine), pipeline runs fail with the error:

```
<class 'dlt.destinations.exceptions.DatabaseTransientException'>
(_duckdb.CatalogException) Catalog Error: Schema with name "myschema" already exists!
[SQL: CREATE SCHEMA myschema]
```

This appears to be due to [SqlalchemyClient.has_dataset](https://github.com/dlt-hub/dlt/blob/89d59d4470716e26b2cf114d61a8e4ada5dbe0ad/dlt/destinations/impl/sqlalchemy/db_api_client.py#L201-L204) not realizing that duckdb_engine returns the list of schemas namespaced by the corresponding database holding the schema ([duckdb_engine.Dialect.get_schema_names](https://github.com/Mause/duckdb_engine/blob/414afdf19c06c3bfa4c23e9892440f9294fbc35b/duckdb_engine/__init__.py#L377-L398)). i.e. if my pipeline name is `mydata` and my dataset name is `myschema`, duckdb_engine returns `mydata.myschema` in the list of returned schemas. dlt, however, is looking only for `myschema`. As a result, dlt believes the schema is missing and it attempts to create a schema that already exists, resulting in the above error.

### Expected behavior

dlt should detect that the schema already exists and not attempt to create it again, allowing the pipeline to complete successfully.

### Steps to reproduce

The following Python program reproduces the problem:

```python
import os
import shutil

import dlt
from sqlalchemy import create_engine

# ENGINE = "sqlite"
ENGINE = "duckdb"
PIPELINE = "mydata"
DATASET = "myschema"

if __name__ == '__main__':
    dbpath = f"{PIPELINE}.db"
    if os.path.exists(dbpath):
        os.remove(dbpath)
    if os.path.isdir(PIPELINE):
        shutil.rmtree(PIPELINE)

    engine = create_engine(f"{ENGINE}:///{dbpath}")

    pipeline = dlt.pipeline(
        destination=dlt.destinations.sqlalchemy(engine),
        pipeline_name=PIPELINE,
        dataset_name=DATASET,
        pipelines_dir=".",
    )

    info = pipeline.run([
        {"id": 1},
        {"id": 2},
        {"id": 3},
    ], table_name="numbers")

    print(info)
```

Here's the contents of my pyproject.toml:

```toml
[project]
name = "minimal_dlt_schema_problem_repro"
version = "0.1.0"
description = "Add your description here"
requires-python = ">=3.14"
dependencies = [
    "dlt[duckdb,sqlalchemy]>=1.26.0",
    "duckdb-engine>=0.17.0",
]
```

### Operating system

macOS

### Runtime environment

Local

### Python version

3.13

### dlt data source

It doesn't matter. I can reproduce this with a custom data source as well as with a static list of Python maps.

### dlt destination

_No response_

### Other deployment details

_No response_

### Additional information

While we could sidestep this problem by using the [DuckDB destination](https://dlthub.com/docs/dlt-ecosystem/destinations/duckdb), we're using SQLAlchemy because the DuckDB destination does not support [encrypted DuckDB files](https://duckdb.org/docs/lts/sql/statements/attach#database-encryption). In order to support encrypted DuckDB files, our best option is to use SQLAlchemy, which will allow us to execute the necessary SQL for all new connections in order to `ATTACH` an encrypted database (this functionality isn't available when utilizing the DuckDB destination).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SQLAlchemy destination + duckdb_engine fails with "Schema with name "{name}" already exists!" #3907

dlt version

Describe the problem

Expected behavior

Steps to reproduce

Operating system

Runtime environment

Python version

dlt data source

dlt destination

Other deployment details

Additional information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

SQLAlchemy destination + duckdb_engine fails with "Schema with name "{name}" already exists!" #3907

Description

dlt version

Describe the problem

Expected behavior

Steps to reproduce

Operating system

Runtime environment

Python version

dlt data source

dlt destination

Other deployment details

Additional information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions