Skip site navigation (1) Skip section navigation (2)

Re: Stream Copy for 8.1 - 8.3dev

From: "Kalle Hallivuori" <kato(at)iki(dot)fi>
To: "Kris Jurka" <books(at)ejurka(dot)com>
Cc: pgsql-jdbc(at)postgresql(dot)org
Subject: Re: Stream Copy for 8.1 - 8.3dev
Date: 2007-07-16 07:15:34
Message-ID: c637d8bb0707160015l6ecf2bf1x5c89c5f85702f2f4@mail.gmail.com (view raw or flat)
Thread:
Lists: pgsql-jdbc
Hi Kris!

2007/7/16, Kris Jurka <books(at)ejurka(dot)com>:
> > http://kato.iki.fi/sw/db/postgresql/jdbc/copy/
>
> Some thoughts after giving this a read (but not a test).

Thanks a lot!

> 1) Your CopyObjects example pays no attention to encoding or escaping.
> Calling getBytes() will give you bytes in the JVM's character set which
> can easily be different than the client_encoding which the driver will
> always set to UTF-8 (for 7.3 and up).  You also have to think about what
> happens when your data contains a null escape, delimiter, or newline
> itself.

Yes, it's very quick and dirty example. I'll look into providing a
more thorough mediator class as you propose below.

> 2) I'm not sure it's helpful to have the copy methods throw both an IO and
> SQL exception.  Why not just wrap any IOExceptions inside a SQLException?

I thought SQLException means that the transaction failed, but
connection is still usable, whereas an IOException leaves the
connection in an unknown state so that it must be dropped. But since
that is not consistent with the rest of the driver, it's obviously
wrong thought :) I'll wrap'em.

> 3) What is the purpose of the reuse_buffer parameter?  What is the use
> case for someone wanting a "fresh" buffer every time?

A custom stream might store them internally. Obviously I should
document such details before giving the code out for review.

> 4) buildCopyQuery doesn't handle quoting/escaping of identifiers.  By
> providing something like this you're now responsible for all the "hard"
> stuff.  I would leave it out unless you're prepared to do a lot more
> thinking about it.  It also doesn't handle all the possible copy options.

I'll just drop the query building functionality. It doesn't really
offer much value, as the caller has to be familiar with the syntax
anyway.

> Perhaps you should split this into core copy functionality and a helper
> class that builds upon it and can provide other useful things (escaping /
> conversion of java objects to pg datatypes).

I'll look into providing a couple of helper classes to target these.

On one hand I think we should honor the copy philosophy of keeping the
raw data as flat as possible. On the other hand, providing a full
object interface on top of the former would be definitely sexy :)

> 5) The coding in QueryExecutorImpl is unsafe because once you get
> CopyInResponse you loop firing away data without listening for any return
> data.  Consider a table which had a trigger on it which issued a notice
> for each row it received.  If you don't read from the server the server
> will block and then you'll block sending it data.  See the comments in
> core/v3/QueryExecutorImpl near MAX_BUFFERED_QUERIES for more details.

I'll fix that. Thanks!

> 6) You mix warnings and errors together.  They should be kept separate.

Ok, I'll fix that too.

> 7) When you get CopyResponses and the user hasn't provided the appropriate
> stream you bail leaving the protocol in an unknown state.  You should
> issue a CopyFail and wait for ReadyForQuery so the whole connection isn't
> lost.

Oops, that's really lousy of me. Will fix.

I'll let you know once I have a fixed version available.

Cheers,

-- 
Kalle Hallivuori +358-41-5053073 http://korpiq.iki.fi/

In response to

pgsql-jdbc by date

Next:From: Marek LewczukDate: 2007-07-16 07:41:11
Subject: Re: patch - support for multi-dimensional arrays and NULL values
Previous:From: Marek LewczukDate: 2007-07-16 06:34:40
Subject: Re: patch - support for multi-dimensional arrays and NULL values

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group