Commit c157afa
ORC-1740: Avoid the dump tool repeatedly parsing ColumnStatistics
### What changes were proposed in this pull request?
This PR aims to avoid the dump tool repeatedly parsing ColumnStatistics.
### Why are the changes needed?
`org.apache.orc.StripeStatistics#getColumnStatistics` always generates statistical information for all columns. When there are many columns, the parsing performance decreases.
https://github.com/apache/orc/blob/c38e20d862ce19395558e092dd42033a000fe22d/java/core/src/java/org/apache/orc/StripeStatistics.java#L57-L66
### How was this patch tested?
local test and exist UT
### Was this patch authored or co-authored using generative AI tooling?
No
Closes #1972 from cxzl25/ORC-1740.
Authored-by: sychen <sychen@ctrip.com>
Signed-off-by: William Hyun <william@apache.org>1 parent c38e20d commit c157afa
2 files changed
Lines changed: 6 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
357 | 357 | | |
358 | 358 | | |
359 | 359 | | |
360 | | - | |
| 360 | + | |
| 361 | + | |
361 | 362 | | |
362 | | - | |
| 363 | + | |
363 | 364 | | |
364 | 365 | | |
365 | 366 | | |
| |||
Lines changed: 3 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
112 | 112 | | |
113 | 113 | | |
114 | 114 | | |
115 | | - | |
| 115 | + | |
| 116 | + | |
116 | 117 | | |
117 | 118 | | |
118 | | - | |
| 119 | + | |
119 | 120 | | |
120 | 121 | | |
121 | 122 | | |
| |||
0 commit comments