polar
Polars
It is Rust
a fast DataFrame
library based on writing.
It has the following properties:
- multi-core utilization
- Optimize queries to reduce unnecessary work and memory allocation
- Handle datasets larger than memory
- have a consistent and predictable
API
- have a strict format (the data type should be known before running the query)
Polars
Has C、C++
the performance of , and has full control over the performance-critical parts of the query engine.
Polars
Do your best:
- Reduce redundant copies
- Efficiently traverse the memory cache
- Minimize Parallelism Contention
- Process data in fast form
- reuse memory allocation
Polars
IO can also be controlled, allowing it to save redundant copies and reduce projections
and predicates
to scan level.
Polars
Written from scratch to parallelize DataFrame
queries on .
Polars
Has lazy
and semi-lazy
schemas.
In lazy
mode, we can optimize the entire query to further improve performance and memory pressure.
Pandas
Pandas
Also DataFrame
a library for handling .
Polars
Compared Pandas
with , it has the following advantages:
- Internal implementation results in less overhead
- Default parallel processing and optimization