Quick Links

scaling up from t1n to 60 million records

From:	Martin Mueller <martinmueller(at)northwestern(dot)edu>
To:	"pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject:	scaling up from t1n to 60 million records
Date:	2026-05-19 14:27:28
Message-ID:	CY8PR05MB1010861EAD48ED098786C9690C4002@CY8PR05MB10108.namprd05.prod.outlook.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

I use Postgres with a GUI frontend (Aquafold) as a very large spreadsheet on steroids that analyzes rare or defective spellings in a corpus of 65,000 texts and1.5 billion words. I typically extract data from the corpus with python scripts, turn them into tables and load them into the database.

On my Mac with 32 GB of memory performance is OK with queries that typically within seconds extract data rows from tables with up to ten million rows. If the result set is large, I suspect that most of time machine's time is spent displaying result sets. I have used indexing sparingly. While it helps, the time savings often don't matter much.

I am thinking about scaling up to table with about 60 million rows. Are there things to do or watch out for? Or should I proceed on the assumption that that 60 million records are within scope and that the added timecost is roughly linear?

Martin Mueller
Professor emeritus of English and Classics
Northwestern University

Responses

Re: scaling up from t1n to 60 million records at 2026-05-19 14:32:39 from Jan Karremans
Re: scaling up from t1n to 60 million records at 2026-05-19 14:41:42 from Ron Johnson
Re: scaling up from t1n to 60 million records at 2026-05-19 14:44:57 from Adrian Klaver

Browse pgsql-general by date

	From	Date	Subject
Next Message	Jan Karremans	2026-05-19 14:32:39	Re: scaling up from t1n to 60 million records
Previous Message	Sandeep	2026-05-19 04:25:17	Re: REQ -POA on Minor Version upgrade