pgsql: Speed up byteain by not parsing traditional-style input twice.

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: pgsql-committers(at)lists(dot)postgresql(dot)org
Subject: pgsql: Speed up byteain by not parsing traditional-style input twice.
Date: 2025-07-18 20:42:18
Message-ID: E1ucruM-006yYH-2A@gemulon.postgresql.org
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-committers

Speed up byteain by not parsing traditional-style input twice.

Instead of laboriously computing the exact output length, use strlen
to get an upper bound cheaply. (This is still O(N) of course, but
the constant factor is a lot less.) This will typically result in
overallocating the output datum, but that's of little concern since
it's a short-lived allocation in just about all use-cases.

A simple microbenchmark showed about 40% speedup for long input
strings.

While here, make some cosmetic cleanups and add a test case that
covers the double-backslash code path in byteain and byteaout.

Author: Steven Niu <niushiji(at)gmail(dot)com>
Reviewed-by: Kirill Reshke <reshkekirill(at)gmail(dot)com>
Reviewed-by: Stepan Neretin <slpmcf(at)gmail(dot)com>
Reviewed-by: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Discussion: https://postgr.es/m/ca315729-140b-426e-81a6-6cd5cfe7ecc5@gmail.com

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/3683af617044d271ab7486d43d06f9689ed4961d

Modified Files
--------------
src/backend/utils/adt/bytea.c | 61 +++++++++--------------------------
src/test/regress/expected/strings.out | 12 +++++++
src/test/regress/sql/strings.sql | 2 ++
3 files changed, 30 insertions(+), 45 deletions(-)

Browse pgsql-committers by date

  From Date Subject
Next Message Michael Paquier 2025-07-19 05:47:25 pgsql: Add PostgreSQL::Test::Cluster::is_alive()
Previous Message Nathan Bossart 2025-07-18 16:27:38 pgsql: Remove unused variable in generate-lwlocknames.pl.