Quick Links

Re: [PATCH] Hex-coding optimizations using SVE on ARM.

From:	"Chiranmoy(dot)Bhattacharya(at)fujitsu(dot)com" <Chiranmoy(dot)Bhattacharya(at)fujitsu(dot)com>
To:	Nathan Bossart <nathandbossart(at)gmail(dot)com>
Cc:	John Naylor <johncnaylorls(at)gmail(dot)com>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, David Rowley <dgrowleyml(at)gmail(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>, "Devanga(dot)Susmitha(at)fujitsu(dot)com" <Devanga(dot)Susmitha(at)fujitsu(dot)com>, "Ragesh(dot)Hajela(at)fujitsu(dot)com" <Ragesh(dot)Hajela(at)fujitsu(dot)com>
Subject:	Re: [PATCH] Hex-coding optimizations using SVE on ARM.
Date:	2025-09-11 10:43:56
Message-ID:	OS9PR01MB1518513BA12A54A243342A2999709A@OS9PR01MB15185.jpnprd01.prod.outlook.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

> Thanks. I noticed that this stuff is simple enough that we can use
> port/simd.h (with a few added functions). This is especially nice because
> it takes care of x86, too. The performance gains look similar to what you
> reported for v6:

This looks good, much cleaner.
One possible improvement would be to use a vectorized table lookup instead of compare and add. I compared v6 and v7 Neon versions, and v6 is always faster.
I’m not sure if SSE2 has a table lookup similar to Neon.

arm - m7g.4xlarge
buf | v6-Neon| v7-Neon| % diff
-------+--------+--------+--------
64 | 6.16 | 8.57 | 28.07
128 | 11.37 | 15.77 | 27.87
256 | 18.54 | 30.28 | 38.77
512 | 33.98 | 62.15 | 45.33
1024 | 64.46 | 117.55 | 45.16
2048 | 124.28 | 254.86 | 51.24
4096 | 243.47 | 509.23 | 52.19
8192 | 487.34 | 953.81 | 48.91

-----
Chiranmoy

In response to

Re: [PATCH] Hex-coding optimizations using SVE on ARM. at 2025-09-11 03:12:52 from Nathan Bossart

Responses

Re: [PATCH] Hex-coding optimizations using SVE on ARM. at 2025-09-11 15:24:27 from Nathan Bossart

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Vamshikrishna T	2025-09-11 10:48:57	Re: AIX support
Previous Message	Ashutosh Sharma	2025-09-11 09:45:48	Re: Clear logical slot's 'synced' flag on promotion of standby