Skip to content

Dataframe doctests in the main branch are taking very long to run (over 60 seconds) #5347

@iajoiner

Description

@iajoiner

Describe the bug
A clear and concise description of what the bug is.
Doctests in dataframe.rs are taking very long to run in the main branch. Moreover the tests likely use up so much resource that it is not even easy to submit this issue or switch to another tab while the tests are running.

test src/dataframe.rs - dataframe::DataFrame (line 62) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::aggregate (line 189) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::cache (line 860) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::collect (line 471) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::collect_partitioned (line 550) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::count (line 438) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::distinct (line 287) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::except (line 719) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::execute_stream (line 530) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::execute_stream_partitioned (line 569) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::explain (line 658) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::filter (line 169) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::intersect (line 696) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::join (line 330) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::join_on (line 371) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::limit (line 221) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::registry (line 678) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::repartition (line 417) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::schema (line 590) has been running for over 60 seconds
test src/dataframe.rs - dataframe::DataFrame::select (line 124) has been running for over 60 seconds

To Reproduce
Steps to reproduce the behavior:
cargo test --docs DataFrame
Expected behavior
A clear and concise description of what you expected to happen.
The tests should be faster and shouldn't cause my machine to hang.
Additional context
Add any other context about the problem here.
I'm actually on a pretty new and good Ubuntu 22.04/AMD64 machine.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions