Re: getTables or code problem?

From: "Alessandro Depase" <alessandro(dot)depase(at)libero(dot)it>
To: "Kris Jurka" <books(at)ejurka(dot)com>, <pgsql-jdbc(at)postgresql(dot)org>
Subject: Re: getTables or code problem?
Date: 2004-01-17 11:39:07
Message-ID: 001b01c3dcee$8507e480$a4cf3052@dedalus1
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-jdbc

> On Fri, 16 Jan 2004, Alessandro Depase wrote:
>
> > > The way the postgresql server works is that you can only connect to
one
> > > database at a time. This means that is impossible to list the tables
in a
> > > database that you are not connected to. The thing is that you can get
a
> > > list of all the databases from any database because pg_database is a
> > > global table. It has been debated here before whether we should
return
> > > only the database you are connected to in the results of
getCatalogs().
> > > Another option would be to put in a check on a call like getTables to
see
> > > if a catalog was passed in that is different from the connected
catalog
> > > and either throw an Exception or return no results.
> > >
> > > What would make the most sense to you given the server's current
> > > restrictions?
> >
> > Thanks for your answer: at least now I know that there are such
> > restrictions.
> > I think that one of the two options (Exception or no result) will be
better
> > and less confusing that a list of tables belonging to another catalog.
Let's
> > suppose that we have several catalogs with almost the same tables (some
> > fields different, a table more, a table less as sometimes happens when
you
> > have different installation of the same system for different
customers...)
> > it could be difficult to understand that we are getting the wrong result
> > (because, for example, we are connected with a wrong connection string)
and
> > we could modify data where they should not be modified.
> >
> > My vote is for the Exception: in this way, with a clear message, I can
> > easily understand what the problem is, while an empty result could be
also a
> > bug in my code.
>
> Thinking about this some more, I don't think an Exception is correct. We
> don't throw an Exception if the given schema doesn't exist for example. I
> think we should just return no rows. The other problem is that some
> getXXX methods don't take a catalog name. For example getSchemas().

Yes, I too noted that getXXX methods don't take a catalog mail in my first
mail and it seemed to me a little odd, but thinking about this some more, as
you suggest, I think that your interpretation may be correct (also about the
implementation returning no rows, instead of the Exception). In fact,
reading better the javadocs it seems that the parameters for all those
methods are to be thought as search patterns (but maybe there are
specifications which define this better?). And when you don't find something
you don't have to throw Exceptions. However, how can we discriminate the
case in which we really have no tables, but the catalog is that
corresponding to the actual connection from the case in which the catalog is
not that of the actual connection (these are, in my opinion, very different
situations)? However, maybe the correct answer is, again, in the
specifications. Else both solutions are, in my opinion, better than the
actual implementation, with a so little difference that could be just a
matter of personal preference (and I vote again for the Exception, just to
be able to discriminate the error from the empty result, which are different
things - why don't throw an Exception also if the schema does not exist,
obviously if the specification permits this?).

> > I don't know what you decided at the end of the debate about the result
of
> > getCatalogs: my opinion (if you mind it) is that it's ok the actual
result
> > (because, at least, one can choose to create a new connection to deal
with
> > the other catalogues), above all if the user can see only catalogs to
which
> > he has access to (the user in my test can log on every db, so I cannot
tell
> > if the behaviour is that I just described).
> > Else, there could be a (very little) security problem.
>
> The current getCatalogs() returns all databases whether you can connect to
> them or not. At the moment I'm leaning towards making this return only
> the current database. The reason for this is that consider a tool like a
> cross platform database manager/viewer. (In particular I'm thinking of
> DBVisualizer.) It presents you with a hierarchy which you can navigate
> down catalog->schema->table type->... For an application like this how
> could it possibly know that it can't access other catalogs? It wouldn't
> be so bad if navigating into another database came up with nothing, but it
> will return values from the current db for schemas and possibly table
> types.

Well, I think that the issue is important also for the getSchemas() method,
because, as you noted before, it takes no catalog as parameter.
Which is its behaviour at the moment? Does it returns schemas from all
catalogs or just the schemas for the current connection? If the first is the
actual implementation, maybe this can lead to navigation problems.
However, the reason to return catalogs and schema only for the current
connection is not, in my opinion, the one you reported before (how could it
possibly know that it can't access other catalogs?), because you could
return catalogs for which the current user is enabled to, or, eventually, an
application like DBVisualizer or pgAdmin can simply try, for example, to
open a new connection with the actual data and, in case of error, using a
pop-up window to ask the user the correct parameters for that connection
(and maybe there are a number of other ways...). This, however, could be the
reason not to return schemas from other catalogs, because I don't think that
the same workaround apply well.
But, as we wrote before, the reasons not to return other catalogs could be:
- security (very little problem, because the real block is elsewhere, but
why help hackers know that there are other catalogs?)
- as a matter of congruence with the other getXXX methods (above all the
getSchema, which has the same signature, if it returns only the schemas in
the current connection)
- what do specifications say?

Just a last note: when I ask for tables, the 'table_cat' field is null, as I
wrote in my first mail. This seems to be incorrect to me (but, maybe this is
not the correct place to tell this, because it is very probable that this is
not a JDBC problem, but a general problem). However I cannot imagine a real
effective use for this, above all if we consider what we wrote before: in
fact this field should be filled only and always with the only catalog to
which we are connected to. Or am I missing something?

> Perhaps the core of this problem may be that Connection.setCatalog does
> not throw an Exception when given a catalog name other than the current
> one.

And what is the effect for this?

Bye
Alessandro Depase

In response to

Browse pgsql-jdbc by date

  From Date Subject
Next Message Barry Lind 2004-01-17 22:00:53 Re: getTables or code problem?
Previous Message Kris Jurka 2004-01-16 23:23:46 Re: getTables or code problem?