TST: Make test_sql.py parallelizable #60595

UmbertoFasci · 2024-12-20T19:51:17Z

closes TST: Make test_sql.py parallelizable #60378
All code checks passed.

Description:

Generated individual uuids for individual hardcoded table name instances throughout the test cases as per the original feature description. Handling the test cases for connections with a shared database state as those which leverage the iris, or type table defined and created in fixtures requires more refactoring if needed to make parallelizable.

On that same note, the parallel safety of these particular connections should be further tested to see if they do require unique table names.

AdLThinhRose

???

pandas/tests/io/test_sql.py

UmbertoFasci · 2024-12-21T00:02:47Z

@WillAyd There seems to be an issue due to the way the drop_table function is currently set up when going through the future infer strings (without pyarrow) test. Simply put, when running the tests in parallel, the cleanup operations can fail because one test process might try to check a table's structure right after another process has already deleted that table.

I am thinking this can be handled by simply catching the error that is raised if the table has already been dropped.

mroeschke · 2024-12-21T18:49:44Z

With this change, each test should truly create and drop a unique table. There may be some routines that need to adapt to this (either drop being called more consistently or tables names being pass through from the create to the drop process)

AdLThinhRose · 2024-12-21T18:54:35Z

Vậy nếu như ví dụ thì nhất quán theo 1 các thức thế nào. Khi không thể đưa ra dẫn chứng cụ thể hơn. ( đây xem như góp ý đi). Có lẽ sau 2 h nó lại xãy ra 1 lỗi x2. Hoạc giờ này tôi đang ghim map ngoài thái bình dương thêm 1 lần nữa Vào CN, 22 thg 12, 2024 lúc 01:50 Matthew Roeschke ***@***.***> đã viết:

…

With this change, each test should truly create and drop a unique table. There may be some routines that need to adapt to this (either drop being called more consistently or tables names being pass through from the create to the drop process) — Reply to this email directly, view it on GitHub <#60595 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AKUN6KFCCON3ISH4EME37TT2GWZ55AVCNFSM6AAAAABT7XW7QKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNJYGIYDENZUGA> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

WillAyd · 2024-12-27T15:59:57Z

Yea sorry @UmbertoFasci I suppose this is a little more complicated than I originally envisioned. You might be able to have the individual fixtures generate both a connection and a table name instead of just a connection. That way the filter can just drop the table name at the end of the fixture, if it exists.

There might be some nuance with views but I'd start with tables first and see how far you get

UmbertoFasci and others added 10 commits December 8, 2024 16:56

Make test_sql.py all_connectable tests parallelizable

0fe741b

Make test_sql.py sqlalchemy_connectable tests parallelizable

60f1a14

Make test_sql.py postgresql_connectable tests parallelizable

cc35414

make test_sql.py mysql_connectable tests parallelizable

ecb6c4d

Make test_sql.py sqlite_engine tests parallelizable

0c51e04

Make test_sql.py sqlite_conn tests parallelizable

17cf723

Make test_sql.py other connectable tests parallelizable

4147a77

Make test_sql.py other connectable tests parallelizable 2

5250e1b

Make test_sql.py other connectable tests parallelizable 3

6d73fa0

Merge branch 'main' into make_parallelizable

9c9f7e7

AdLThinhRose reviewed Dec 20, 2024

View reviewed changes

UmbertoFasci requested a review from AdLThinhRose December 20, 2024 19:55

WillAyd requested changes Dec 20, 2024

View reviewed changes

pandas/tests/io/test_sql.py Show resolved Hide resolved

pandas/tests/io/test_sql.py Outdated Show resolved Hide resolved

pandas/tests/io/test_sql.py Outdated Show resolved Hide resolved

UmbertoFasci added 2 commits December 20, 2024 14:43

Remove single CPU marker and update table name creation function name

a354c59

resolve test_api_to_sql_index_label_multiindex error

b18c79c

mroeschke removed the request for review from AdLThinhRose December 20, 2024 22:03

UmbertoFasci added 2 commits December 20, 2024 16:47

resolve tests which create table with create_and_load_postgres_datetz

2a856db

resolve regex error for test_xsqlite_if_exists

7a31438

UmbertoFasci added 3 commits December 20, 2024 18:13

resolve sqlalchemy reflection drop_table race condition

b5a60d9

resolve drop_table sqlalchemy reflection error handling

be28090

revert drop_table error handling

e62a848

rhshadrach added Testing pandas testing functions or related to the test suite IO SQL to_sql, read_sql, read_sql_query labels Dec 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST: Make test_sql.py parallelizable #60595

TST: Make test_sql.py parallelizable #60595

UmbertoFasci commented Dec 20, 2024

AdLThinhRose left a comment

UmbertoFasci commented Dec 21, 2024 •

edited

Loading

mroeschke commented Dec 21, 2024

AdLThinhRose commented Dec 21, 2024 via email

WillAyd commented Dec 27, 2024

TST: Make test_sql.py parallelizable #60595

Are you sure you want to change the base?

TST: Make test_sql.py parallelizable #60595

Conversation

UmbertoFasci commented Dec 20, 2024

AdLThinhRose left a comment

Choose a reason for hiding this comment

UmbertoFasci commented Dec 21, 2024 • edited Loading

mroeschke commented Dec 21, 2024

AdLThinhRose commented Dec 21, 2024 via email

WillAyd commented Dec 27, 2024

UmbertoFasci commented Dec 21, 2024 •

edited

Loading