| From: | Nathan Bossart <nathandbossart(at)gmail(dot)com> | 
|---|---|
| To: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> | 
| Cc: | Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org>, Ants Aasma <ants(dot)aasma(at)cybertec(dot)at>, "Amonson, Paul D" <paul(dot)d(dot)amonson(at)intel(dot)com>, David Rowley <dgrowleyml(at)gmail(dot)com>, Andres Freund <andres(at)anarazel(dot)de>, "Shankaran, Akash" <akash(dot)shankaran(at)intel(dot)com>, Noah Misch <noah(at)leadboat(dot)com>, Matthias van de Meent <boekewurm+postgres(at)gmail(dot)com>, "pgsql-hackers(at)lists(dot)postgresql(dot)org" <pgsql-hackers(at)lists(dot)postgresql(dot)org> | 
| Subject: | Re: Popcount optimization using AVX512 | 
| Date: | 2024-04-02 18:40:21 | 
| Message-ID: | 20240402184021.GA2802541@nathanxps13 | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-hackers | 
On Tue, Apr 02, 2024 at 01:43:48PM -0400, Tom Lane wrote:
> Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org> writes:
>> On 2024-Apr-02, Nathan Bossart wrote:
>>> Another idea I had is to turn pg_popcount() into a macro that just uses the
>>> pg_number_of_ones array when called for few bytes:
>>> 
>>> 	static inline uint64
>>> 	pg_popcount_inline(const char *buf, int bytes)
>>> 	{
>>> 		uint64		popcnt = 0;
>>> 
>>> 		while (bytes--)
>>> 			popcnt += pg_number_of_ones[(unsigned char) *buf++];
>>> 
>>> 		return popcnt;
>>> 	}
>>> 
>>> 	#define pg_popcount(buf, bytes) \
>>> 		((bytes < 64) ? \
>>> 		 pg_popcount_inline(buf, bytes) : \
>>> 		 pg_popcount_optimized(buf, bytes))
>>> 
>>> But again, I'm not sure this is really worth it for the current use-cases.
> 
>> Eh, that seems simple enough, and then you can forget about that case.
> 
> I don't like the double evaluation of the macro argument.  Seems like
> you could get the same results more safely with
> 
> 	static inline uint64
> 	pg_popcount(const char *buf, int bytes)
> 	{
> 		if (bytes < 64)
> 		{
> 			uint64		popcnt = 0;
> 
> 			while (bytes--)
> 				popcnt += pg_number_of_ones[(unsigned char) *buf++];
> 
> 			return popcnt;
> 		}
> 		return pg_popcount_optimized(buf, bytes);
> 	}
Yeah, I like that better.  I'll do some testing to see what the threshold
really should be before posting an actual patch.
-- 
Nathan Bossart
Amazon Web Services: https://aws.amazon.com
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Daniel Gustafsson | 2024-04-02 18:55:41 | Re: Cutting support for OpenSSL 1.0.1 and 1.0.2 in 17~? | 
| Previous Message | David E. Wheeler | 2024-04-02 18:38:56 | RFC: Additional Directory for Extensions |