| From: | Nathan Bossart <nathandbossart(at)gmail(dot)com> |
|---|---|
| To: | "Chiranmoy(dot)Bhattacharya(at)fujitsu(dot)com" <Chiranmoy(dot)Bhattacharya(at)fujitsu(dot)com> |
| Cc: | "Malladi, Rama" <ramamalladi(at)hotmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>, "Ragesh(dot)Hajela(at)fujitsu(dot)com" <Ragesh(dot)Hajela(at)fujitsu(dot)com>, Salvatore Dipietro <dipiets(at)amazon(dot)com>, "Devanga(dot)Susmitha(at)fujitsu(dot)com" <Devanga(dot)Susmitha(at)fujitsu(dot)com> |
| Subject: | Re: [PATCH] SVE popcount support |
| Date: | 2025-02-06 16:33:35 |
| Message-ID: | Z6TkX1Uo3kxeUlKp@nathan |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
On Thu, Feb 06, 2025 at 08:44:35AM +0000, Chiranmoy(dot)Bhattacharya(at)fujitsu(dot)com wrote:
>> Does this hand-rolled loop unrolling offer any particular advantage? What
>> do the numbers look like if we don't do this or if we process, say, 4
>> vectors at a time?
>
> The unrolled version performs better than the non-unrolled one, but
> processing four vectors provides no additional benefit. The numbers
> and code used are given below.
Hm. Any idea why that is? I wonder if the compiler isn't using as many
SVE registers as it could for this.
--
nathan
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Chris Cleveland | 2025-02-06 16:46:41 | "Type does not exist" error when returning array of type in non-public schema |
| Previous Message | Alvaro Herrera | 2025-02-06 16:28:43 | Re: Support NOT VALID / VALIDATE constraint options for named NOT NULL constraints |