TST: assert reading of legacy pickles against current data #61792

jorisvandenbossche · 2025-07-07T07:15:47Z

While reviewing #61770, I noticed that we didn't actually compare the read pickle data to some ground truth expected value, but just to itself (we were essentially doing assert_equal(result, result) ..), due to some accidental change in a clean-up many years ago in f2246cf)

Fixing that here by again creating the expected unpickled data with create_pickle_data() during the test run, to compare with the data from the older pickled files.

simonjayhawkins · 2025-07-07T09:07:16Z

pandas/tests/io/generate_legacy_storage_files.py

+        # "cat_onecol": DataFrame({"A": Categorical(["foo", "bar"])}),
+        "cat_onecol": DataFrame(
+            {
+                "A": Categorical.from_codes(
+                    [1, 0], categories=Index(["bar", "foo"], dtype="object")
+                )
+            }
+        ),


@jorisvandenbossche to get the old behavior here, the code changes are a bit more involved. I've not got round to reviewing all the migration guides/release notes yet. Is this included? if not, should it be?

simonjayhawkins · 2025-07-07T09:11:21Z

pandas/tests/io/test_pickle.py

+                    and legacy_version < Version("1.3.0")
+                ):
+                    # convert to wall time
+                    # (bug since pandas 2.0 that tz gets dropped for older pickle files)


is there an issue ref for this

jbrockmendel · 2025-07-10T16:47:16Z

can you merge main and see if the pyarrow decimal issue resolves itself?

github-actions · 2025-08-10T00:10:01Z

This pull request is stale because it has been open for thirty days with no activity. Please update and respond to this comment if you're still interested in working on this.

TST: assert reading of legacy pickles against current data

6d249e8

jorisvandenbossche added Testing pandas testing functions or related to the test suite IO Pickle read_pickle, to_pickle labels Jul 7, 2025

simonjayhawkins reviewed Jul 7, 2025

View reviewed changes

mroeschke added this to the 3.0 milestone Jul 7, 2025

github-actions bot added the Stale label Aug 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

TST: assert reading of legacy pickles against current data #61792

TST: assert reading of legacy pickles against current data #61792

Uh oh!

jorisvandenbossche commented Jul 7, 2025 •

edited

Loading

Uh oh!

simonjayhawkins Jul 7, 2025

Uh oh!

simonjayhawkins Jul 7, 2025

Uh oh!

jbrockmendel commented Jul 10, 2025

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

Uh oh!

Uh oh!

TST: assert reading of legacy pickles against current data #61792

Are you sure you want to change the base?

TST: assert reading of legacy pickles against current data #61792

Uh oh!

Conversation

jorisvandenbossche commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simonjayhawkins Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

simonjayhawkins Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel commented Jul 10, 2025

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

Uh oh!

jorisvandenbossche commented Jul 7, 2025 •

edited

Loading