Skip to content

lpad, rpad, translate should use codepoints, not graphemes #21060

@neilconway

Description

@neilconway

Describe the bug

For consistency with most other SQL implementations (including Postgres), as well as the rest of DataFusion, lpad, rpad, and translate should use codepoints when manipulating Unicode strings, not graphemes. #3054 switched to using codepoints instead of graphemes in most places, but AFAICS the same should be done for these remaining UDFs.

To Reproduce

No response

Expected behavior

No response

Additional context

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions