-
-
Notifications
You must be signed in to change notification settings - Fork 18.8k
Insights: pandas-dev/pandas
Overview
Could not load contribution data
Please try again later
18 Pull requests merged by 11 people
-
BUG: Add min/max methods to ArrowExtensionArray GH#61311
#61924 merged
Aug 5, 2025 -
REF: simplify mask_missing
#62049 merged
Aug 5, 2025 -
DOC: Add SSLCertVerificationError warning message for documentation b…
#62015 merged
Aug 5, 2025 -
REF: Avoid/defer
dtype=object
containers in plotting#62042 merged
Aug 5, 2025 -
API: improve dtype in df.where with EA other
#62038 merged
Aug 5, 2025 -
REF: make copy keyword in recode_for_categories keyword only
#62019 merged
Aug 4, 2025 -
API: rank with nullable dtypes preserve NA
#62043 merged
Aug 4, 2025 -
[pre-commit.ci] pre-commit autoupdate
#62041 merged
Aug 4, 2025 -
TST: nan->NA in non-construction tests
#62021 merged
Aug 4, 2025 -
EHN: return early when the result is None
#62032 merged
Aug 4, 2025 -
Bump pypa/cibuildwheel from 3.1.1 to 3.1.3
#62039 merged
Aug 4, 2025 -
TST: Speed up hypothesis and slow tests
#62028 merged
Aug 3, 2025 -
BUG: Change default of observed in Series.groupby
#62025 merged
Aug 2, 2025 -
BUG: Avoid copying categorical codes if
copy=False
#62000 merged
Aug 1, 2025 -
DOC: Improve documentation for DataFrame.__setitem__ and .loc assignment from Series
#61804 merged
Aug 1, 2025 -
DOC: rephrase CoW ChainedAssignmentError message now CoW is always enabled
#61970 merged
Jul 30, 2025 -
TST: Raise on
pytest.PytestWarning
#61996 merged
Jul 30, 2025 -
BUG/DEPR: logical operation with bool and string
#61995 merged
Jul 29, 2025
17 Pull requests opened by 15 people
-
DOC: documenting pandas.MultIndex.argsort
#62005 opened
Jul 30, 2025 -
DOC: Standardize noncompliant docstrings in pandas/io/html.py (flake8-docstrings) #61944
#62007 opened
Jul 31, 2025 -
BUG FIX: pandas.arrays.IntervalArray.overlaps() incorrectly documents that it accepts IntervalArray.
#62009 opened
Jul 31, 2025 -
BUG: Fix assert_series_equal for categoricals with nulls and check_category_order=False (#62008)
#62011 opened
Jul 31, 2025 -
Fix cbusday calendar Typecheck v2
#62014 opened
Jul 31, 2025 -
DOC: Add example for multi-column joins using `merge`
#62016 opened
Aug 1, 2025 -
BUG: Fix assert_series_equal with check_category_order=False for categoricals with nulls
#62017 opened
Aug 1, 2025 -
BUG: Fix is_signed_integer_dtype to handle abstract floating types (GH 62018)
#62020 opened
Aug 1, 2025 -
continue from #61957 which closed with unmerged commit
#62023 opened
Aug 2, 2025 -
BUG: groupby.idxmin/idxmax with all NA values should raise
#62026 opened
Aug 2, 2025 -
DOC: fix mask/where docstring alignment note (#61781)
#62029 opened
Aug 2, 2025 -
BUG: Catch TypeError in _is_dtype_type when converting abstract numpy types (#62018)
#62030 opened
Aug 3, 2025 -
API: timestamp resolution inference: default to microseconds when possible
#62031 opened
Aug 3, 2025 -
BUG: raise a proper exception when str.rsplit is passed a regex and clarify the docs
#62035 opened
Aug 3, 2025 -
API: mode.nan_is_na to consistently distinguish NaN-vs-NA
#62040 opened
Aug 4, 2025 -
DOC: updated BooleanDType docstring
#62045 opened
Aug 4, 2025 -
BUG Updated border attribute to in-line CSS
#62046 opened
Aug 5, 2025
72 Issues closed by 8 people
-
QST: How to convert large queryset into dataframes in optimized way?
#55371 closed
Aug 5, 2025 -
BUG: replace value failed
#61643 closed
Aug 5, 2025 -
DOC: Issue with the general expressiveness of the docs
#61392 closed
Aug 5, 2025 -
ENH: fillna enhancement with method='nearest'
#61124 closed
Aug 5, 2025 -
PERF: why nlargest is so slower?
#61076 closed
Aug 5, 2025 -
DOC: Pivot() example call incorrectly used and would give "error: duplicate index"
#61058 closed
Aug 5, 2025 -
BUILD: Trouble installing pandas on Windows
#60947 closed
Aug 5, 2025 -
QST: Should the absence of tzdata package affect the performance in any way ?
#61218 closed
Aug 5, 2025 -
PERF: Setting an item of incompatible dtype
#61456 closed
Aug 5, 2025 -
BUG: "python make.py html" for documentation is giving error
#60039 closed
Aug 5, 2025 -
BUG: DataFrame Dict not callabe
#60086 closed
Aug 5, 2025 -
ENH: Restore the functionality of `.fillna`
#59831 closed
Aug 5, 2025 -
ENH: Add a clear option to interpret strings as Pandas dtypes specifically.
#59777 closed
Aug 5, 2025 -
BUG: QST: `pd.read_html` gives tables with duplicated columns
#56337 closed
Aug 5, 2025 -
BUG: memory leak when using read_csv with python engine
#59694 closed
Aug 5, 2025 -
Bug in Chunk Processing: Non-NULL IDs Become NULL During IterationBUG:
#59477 closed
Aug 5, 2025 -
BUG: ArrowNotImplementedError: Unsupported cast from int64 to null using function cast_null
#59497 closed
Aug 5, 2025 -
BUG: import has error
#59527 closed
Aug 5, 2025 -
Title: Feature Request: Improve diff Function to Support Forward and Backward CompletionENH:
#59465 closed
Aug 5, 2025 -
BUG: diff method incorrect on datetime64[s]
#59650 closed
Aug 5, 2025 -
ENH: Make merge_asof preserve the index
#60919 closed
Aug 5, 2025 -
BUG: Memory leak when creating a df inside a loop
#60897 closed
Aug 5, 2025 -
BUG: NameError: name 'pa' is not defined despite `pyarrow` is installed
#60573 closed
Aug 5, 2025 -
PERF: index.unique much slower than get_level_values.drop_duplicates
#60213 closed
Aug 5, 2025 -
BUG: reading long csv with high numeric values displays a "mixed types" message
#60082 closed
Aug 5, 2025 -
BUG: Warning when compiling pandas/_libs/algos.c
#58829 closed
Aug 5, 2025 -
ENH: In pandas.testing.assert_frame_equal, support per-column configuration
#59548 closed
Aug 5, 2025 -
'base' argument when resampling has no effect
#22855 closed
Aug 5, 2025 -
ENH: PyPy Support
#42509 closed
Aug 5, 2025 -
ENH: Allow storing timezone-aware datetimes in a series with a datetime64 dtype
#46998 closed
Aug 5, 2025 -
Decrypting file in read_csv
#44097 closed
Aug 5, 2025 -
BUG: StringArray is a subclass of PandasArray
#48638 closed
Aug 5, 2025 -
BUG: 2.2.2 pep517 build fails
#58274 closed
Aug 5, 2025 -
ENH: Exp function for Series and DataFrame
#56748 closed
Aug 5, 2025 -
ENH: Add first_inverted and last_inverted options to keep in DataFrame.duplicated
#60394 closed
Aug 5, 2025 -
BUG: Timestamp.tz and DatetimeIndex.tz are inconsistent when pytz 2024.2 is installed
#59833 closed
Aug 5, 2025 -
BUG: "Python int too large" in maybe_convert_objects with numpy 1.26
#60023 closed
Aug 5, 2025 -
ENH: A .chi2() method on the DataFrame and Series class that will resemble the .corr() methods
#60111 closed
Aug 5, 2025 -
ENH: Decimal year
#60391 closed
Aug 5, 2025 -
PERF: Extreme regression in 2.2 when aggregating a DataFrame with .attrs data
#60070 closed
Aug 5, 2025 -
ENH: Make pd.Timestamp.astimezone() default to local timezone
#60692 closed
Aug 5, 2025 -
ENH: Automate reading of data in chunks
#61110 closed
Aug 5, 2025 -
ENH: The method of obtaining a certain cell or slice of the dataframe is confusing and unclear
#61232 closed
Aug 5, 2025 -
ENH: read_csv tz option
#61535 closed
Aug 5, 2025 -
ENH: New method "ends" as a combination of “head” and "tail"
#61860 closed
Aug 5, 2025 -
ENH: Images embedded in cells. The DISPIMG function of WPS
#61888 closed
Aug 5, 2025 -
ENH: Add a safe Option to hash_pandas_object with Default Value Set to True
#60428 closed
Aug 5, 2025 -
BUG: alpine 3.21 install issue
#60554 closed
Aug 5, 2025 -
ENH: Add an iterdicts() function
#60901 closed
Aug 5, 2025 -
BUG: Pandas resets counter when using filterwarning "once"
#60664 closed
Aug 5, 2025 -
ENH: generic `save` and `read` methods for DataFrame
#60786 closed
Aug 5, 2025 -
ENH: Add a function like PYQT signal
#61829 closed
Aug 5, 2025 -
BUG: 'Sphinx parallel build error' when building docs locally prevents index.html creation
#61975 closed
Aug 5, 2025 -
DOC: Series and DataFrame.reindex accepts Timedelta as tolerance, which is not documented
#62010 closed
Aug 5, 2025 -
QST: best way to extend/subclass pandas.DataFrame
#61362 closed
Aug 5, 2025 -
ENH: Add a name parameter to value_counts()
#55121 closed
Aug 5, 2025 -
BUG: Can't convert `int64[pyarrow]` series with missing values to legacy numpy `float` series
#57093 closed
Aug 5, 2025 -
issubdtype(<categorical>, np.bool_) raises error
#9581 closed
Aug 2, 2025 -
API: Expand read_csv dtype for categoricals
#14503 closed
Aug 2, 2025 -
Dask shuffle performance help
#43155 closed
Aug 2, 2025 -
ENH: to_offset "year"/"quarter"/"month"/"day"/...?
#34586 closed
Aug 2, 2025 -
inf comparison doesn't emulate NaN with use_inf_as_null
#16756 closed
Aug 1, 2025 -
DOC: Improve documentation for DataFrame.__setitem__ and .loc assignment from Series
#61662 closed
Aug 1, 2025 -
BUG: Inconsistent `datetime` dtype based on how the dataframe gets initialized
#61993 closed
Jul 31, 2025 -
BUG: the behavior of DataFrameGroupBy.apply(..., include_groups=True) breaks post-mortem debugging
#61627 closed
Jul 31, 2025 -
BUG: Writing UUIDs fail
#61602 closed
Jul 31, 2025 -
DOC:
#55125 closed
Jul 30, 2025 -
BUG: Python Package fails to load for some users, but not others.
#61991 closed
Jul 30, 2025 -
BUG (string dtype): logical operation with bool and string failing
#60234 closed
Jul 29, 2025
14 Issues opened by 10 people
-
BUG: ArrowEA.astype to categorical returning all-NaT
#62051 opened
Aug 5, 2025 -
ENH: String functions for df.aggregate()
#62050 opened
Aug 5, 2025 -
API: Series[Float64] == False
#62048 opened
Aug 5, 2025 -
BUG: failing when groupby on data containing bytes
#62047 opened
Aug 5, 2025 -
BUG: rank with object dtype and small values
#62036 opened
Aug 3, 2025 -
BUG: Wrong inferred type in case of a mixture of boolean, float and integers
#62033 opened
Aug 3, 2025 -
BUG: NaN vs NA in mixed reduction
#62024 opened
Aug 2, 2025 -
DEPR: convert_dtypes keywords
#62022 opened
Aug 2, 2025 -
BUG: pd.api.types.is_signed_integer_dtype(np.floating) throws TypeError
#62018 opened
Aug 1, 2025 -
BUG: Sql select from database type cast issue
#62013 opened
Jul 31, 2025 -
BUG: `assert_series_equal` broken with `check_category_order=False` for arrays with null values
#62008 opened
Jul 31, 2025 -
BUG: `IntervalArray.overlaps()` documents that it accepts another `IntervalArray`, but it is not implemented
#62004 opened
Jul 30, 2025 -
ENH: error messages for missing performance dependencies should point out the options
#62001 opened
Jul 30, 2025 -
DOC: documenting pandas.MultIndex.argsort
#61998 opened
Jul 29, 2025
78 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
API: offsets.Day is always calendar-day
#61985 commented on
Aug 5, 2025 • 22 new comments -
BUG: Fix ExtensionArray binary op protocol
#61990 commented on
Jul 31, 2025 • 10 new comments -
BUG: Fix Series.str.contains with compiled regex on Arrow string dtype (#61942)
#61946 commented on
Jul 30, 2025 • 10 new comments -
BUG FIX: Using Series.str.fullmatch() and Series.str.match() with a compiled regex fails with arrow strings
#61964 commented on
Jul 30, 2025 • 5 new comments -
ENH: usecols takes input order for read_csv implementation review
#61967 commented on
Aug 5, 2025 • 4 new comments -
BUG: IntervalIndex.unique() only contains the first interval if all interval borders are negative
#61920 commented on
Aug 1, 2025 • 4 new comments -
BUG: Add fillna at the beginning of _where not to fill NA. #60729
#60772 commented on
Jul 30, 2025 • 3 new comments -
ENH: Implement PDEP-17
#61468 commented on
Aug 4, 2025 • 2 new comments -
BUG: Fix Series.reindex losing values when reindexing to MultiIndex
#61969 commented on
Aug 1, 2025 • 1 new comment -
dtype-unaware (empty) objects ("any" dtype)
#19647 commented on
Aug 4, 2025 • 0 new comments -
BUG/API: preserve dtype in Index `append()`
#60159 commented on
Aug 4, 2025 • 0 new comments -
BUG: Empty list passed to `Series` returns `object` dtype, but via `DataFrame` returns `float64`
#56679 commented on
Aug 4, 2025 • 0 new comments -
BUG: Different initialization methods lead to different dtypes (DataFrame)
#42971 commented on
Aug 4, 2025 • 0 new comments -
Dtype inconsistency when appending to empty dataframe
#22621 commented on
Aug 4, 2025 • 0 new comments -
DOC: Fix docstring validation errors for pandas.core.groupby
#60365 commented on
Aug 4, 2025 • 0 new comments -
DOC: Sort the pandas API reference navbar in alphabetical order
#59164 commented on
Aug 4, 2025 • 0 new comments -
Understanding string interning in pd.read_csv vs. other methods of creating large object columns
#52639 commented on
Aug 5, 2025 • 0 new comments -
ENH: Implement `skiptrailingspace` parameter for `read_csv`
#60813 commented on
Aug 5, 2025 • 0 new comments -
Follow-up on basic FloatingArray implementation
#38110 commented on
Aug 5, 2025 • 0 new comments -
Should IntegerArray provide data / mask through an API?
#34873 commented on
Aug 5, 2025 • 0 new comments -
ENH: Support for list of intervals / IntervalArray in to_parquet
#34644 commented on
Aug 5, 2025 • 0 new comments -
ENH: unset_index method
#60869 commented on
Aug 5, 2025 • 0 new comments -
DOC: use shared_docs for isna/notna & isnull/notnull
#17096 commented on
Aug 5, 2025 • 0 new comments -
REF: dtype_backend argument in sql module mixes lib.no_default and numpy
#60326 commented on
Aug 5, 2025 • 0 new comments -
BUG: read_csv loses precision when `engine='pyarrow'` and dtype Int64
#56136 commented on
Aug 5, 2025 • 0 new comments -
Add `strict_na` keyword to the assert_.._equal methods for object dtype to help with deprecation
#58072 commented on
Aug 5, 2025 • 0 new comments -
ENH: support reading directory in read_csv
#61275 commented on
Aug 2, 2025 • 0 new comments -
BUG: fix .str.isdigit to honor unicode superscript for older pyarrow
#61962 commented on
Jul 29, 2025 • 0 new comments -
Implemented NumbaExecutionEngine
#61487 commented on
Jul 31, 2025 • 0 new comments -
ENH: Add `force_suffixes` boolean argument to `pd.merge`
#61498 commented on
Aug 2, 2025 • 0 new comments -
BUG: Fix GroupBy aggregate coersion of outputs inconsistency for pyarrow dtypes
#61640 commented on
Aug 2, 2025 • 0 new comments -
API: consistent NaN treatment for pyarrow dtypes
#61732 commented on
Aug 2, 2025 • 0 new comments -
BUG: fillna with DataFrame input should preserve dtype when possible
#61742 commented on
Jul 31, 2025 • 0 new comments -
DOC: Clarify broadcasting behavior when using lists in DataFrame arithmetic (GH18857)
#61820 commented on
Jul 30, 2025 • 0 new comments -
DOC: mention .attrs are preserved in Parquet IO for pyarrow engine
#61912 commented on
Jul 31, 2025 • 0 new comments -
DOC: added button to edit on GitHub
#61956 commented on
Aug 4, 2025 • 0 new comments -
CoW: add readonly flag to ExtensionArrays, return read-only EA/ndarray in .array/EA.to_numpy()
#61925 commented on
Aug 3, 2025 • 0 new comments -
Updating error message to include more details about dataframe
#61929 commented on
Jul 30, 2025 • 0 new comments -
TST: run python-dev CI on 3.14-dev
#61950 commented on
Aug 3, 2025 • 0 new comments -
BUG: Instantiating pandas._libs.parsers.TextReader segfaults
#53131 commented on
Aug 4, 2025 • 0 new comments -
ENH: convert masked arrays for Series
#20427 commented on
Aug 1, 2025 • 0 new comments -
Read_csv leaks memory when used in multiple threads
#19941 commented on
Aug 1, 2025 • 0 new comments -
BUG: assert_frame_equal(check_dtype=False) fails when comparing two DFs containing pd.NA that only differ in dtype (object vs Int32)
#61473 commented on
Aug 1, 2025 • 0 new comments -
DOC: Docstring for BooleanDType inconsistent with the rest of repo
#61939 commented on
Aug 1, 2025 • 0 new comments -
BUG: read_sql no longer works simply with SqlAlchemy selectables and a quick fix
#35484 commented on
Jul 31, 2025 • 0 new comments -
BUG: Merge fails on pyarrow datetime columns
#61926 commented on
Jul 31, 2025 • 0 new comments -
DOC: warn about apply with raw=True, if function returns Optional[int]
#61632 commented on
Jul 31, 2025 • 0 new comments -
BUG: series.groupby(level=0) and series.groupby(pd.Grouper(level=0)) produce different orders
#61943 commented on
Jul 30, 2025 • 0 new comments -
BUG: The lines skipped by pd.read_csv(path, chunksize=n, on_bad_lines='skip') depends on `n`.
#61973 commented on
Jul 30, 2025 • 0 new comments -
DOC: add button to edit on GitHub
#39859 commented on
Jul 30, 2025 • 0 new comments -
ENH: error message context.
#61834 commented on
Jul 30, 2025 • 0 new comments -
FEEDBACK: PyArrow as a required dependency and PyArrow backed strings
#54466 commented on
Jul 30, 2025 • 0 new comments -
BUG: Inconsistent .values NA/NaN
#61856 commented on
Jul 30, 2025 • 0 new comments -
BUG: CustomBusinessDay not respecting calendar
#60647 commented on
Jul 30, 2025 • 0 new comments -
BUG: Series.str.isdigit with pyarrow dtype doesn't honor unicode superscripts
#61466 commented on
Jul 30, 2025 • 0 new comments -
DOC: Point out difference in usage of "str" dtype in constructor and astype member
#61992 commented on
Jul 30, 2025 • 0 new comments -
BUG: resampling with origin='end_day' raises ValueError: Values falls before first bin
#44957 commented on
Jul 30, 2025 • 0 new comments -
BUG: Operations not implemented for non-1D ExtensionArrays
#61866 commented on
Jul 30, 2025 • 0 new comments -
TRACKER: new default String dtype (pyarrow-backed, numpy NaN semantics)
#54792 commented on
Jul 29, 2025 • 0 new comments -
BUG: Python 3.14 may not increment refcount
#61368 commented on
Aug 3, 2025 • 0 new comments -
API: resolution for date_range, to_datetime, timedelta_range, to_timedelta
#49060 commented on
Aug 3, 2025 • 0 new comments -
BUG: pd.Timestamp() defaults to [s] resolution instead of [ns]
#52653 commented on
Aug 3, 2025 • 0 new comments -
BUG/API: round-tripping non-nano datetime64s with to_json/read_json
#55827 commented on
Aug 3, 2025 • 0 new comments -
API: timestamp resolution inference - default to one unit (if possible) instead of being data-dependent?
#58989 commented on
Aug 3, 2025 • 0 new comments -
BUG: groupby.idxmin/idxmax will all NA values
#57745 commented on
Aug 2, 2025 • 0 new comments -
BUG: ValueError: Buffer dtype mismatch, expected 'intp_t' but got 'long long' on ARMv7 32 bit
#55305 commented on
Aug 2, 2025 • 0 new comments -
BUG: Implicit upcast to float64 for large series (more than 1000000 rows)
#61951 commented on
Aug 2, 2025 • 0 new comments -
Add axis argument to DataFrame.corr
#35002 commented on
Aug 2, 2025 • 0 new comments -
ENH: .interval accessor
#16401 commented on
Aug 2, 2025 • 0 new comments -
Groupby.mode() - feature request
#19254 commented on
Aug 2, 2025 • 0 new comments -
ENH: Add Support for GroupBy Numeric Operations
#20060 commented on
Aug 2, 2025 • 0 new comments -
ENH: Implementing NEP 18's __array_function__
#26380 commented on
Aug 2, 2025 • 0 new comments -
Idea: Make pd.Index.values not writable
#33001 commented on
Aug 2, 2025 • 0 new comments -
QST: "Dummy" is rooted in ableist language
#35724 commented on
Aug 2, 2025 • 0 new comments -
ENH: Include df.attrs in to_csv output
#53577 commented on
Aug 2, 2025 • 0 new comments -
ENH: Disallow duplicate column names everywhere by default
#53217 commented on
Aug 2, 2025 • 0 new comments -
ENH: Include `df.attrs` in `to_json` output
#51012 commented on
Aug 2, 2025 • 0 new comments -
BUG: Read hdfs:/// is broken since 2.1.0
#58078 commented on
Aug 1, 2025 • 0 new comments