Talk About Network

Google


Register and Login
Nick
Password
Register create new account Sign up is FREE and you can post replies, new topics, bookmark posts and more!
Recover lost password


Data Bases > Pgsql Performance > Re: Group by mo...
Latest [ Topics | Posts ] Archive Post A New Topic Post a Reply
<< Topic < Post Post 13 of 14 Topic 4002 of 4424
Post > Topic >>

Re: Group by more efficient than distinct?

by matthew@[EMAIL PROTECTED] (Matthew Wakeling) Apr 22, 2008 at 01:22 PM

On Tue, 22 Apr 2008, Mark Mielke wrote:
> The poster I responded to said that the memory required for a hash join
was 
> relative to the number of distinct values, not the number of rows. They
gave 
> an example of millions of rows, but only a few distinct values. Above,
you 
> agree with me that it it would include the rows (or at least references
to 
> the rows) as well. If it stores rows, or references to rows, then memory
*is* 
> relative to the number of rows, and millions of records would require 
> millions of rows (or row references).

Yeah, I think we're talking at cross-purposes, due to hash tables being 
used in two completely different places in Postgres. Firstly, you have 
hash joins, where Postgres loads the references to the actual rows, and 
puts those in the hash table. For that situation, you want a small number 
of rows. Secondly, you have hash aggregates, where Postgres stores an 
entry for each "group" in the hash table, and does not store the actual 
rows. For that situation, you can have a bazillion individual rows, but 
only a small number of distinct groups.

Matthew

-- 
First law of computing:  Anything can go wro
sig: Segmentation fault.  core dumped.

-- 
Sent via pgsql-performance mailing list (pgsql-performance@[EMAIL PROTECTED]
)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance
 




 14 Posts in Topic:
Group by more efficient than distinct?
lists@[EMAIL PROTECTED]   2008-04-17 23:46:08 
Re: Group by more efficient than distinct?
mlists@[EMAIL PROTECTED]   2008-04-18 09:25:04 
Re: Group by more efficient than distinct?
stark@[EMAIL PROTECTED]   2008-04-18 10:36:02 
Re: Group by more efficient than distinct?
lists@[EMAIL PROTECTED]   2008-04-18 12:35:04 
Re: Group by more efficient than distinct?
lists@[EMAIL PROTECTED]   2008-04-20 11:15:36 
Re: Group by more efficient than distinct?
lists@[EMAIL PROTECTED]   2008-04-22 01:34:40 
Re: Group by more efficient than distinct?
mark@[EMAIL PROTECTED] (  2008-04-21 19:50:22 
Re: Group by more efficient than distinct?
mark@[EMAIL PROTECTED] (  2008-04-21 21:39:15 
Re: Group by more efficient than distinct?
lists@[EMAIL PROTECTED]   2008-04-20 11:12:10 
Re: Group by more efficient than distinct?
llonergan@[EMAIL PROTECTE  2008-04-20 22:35:58 
Re: Group by more efficient than distinct?
matthew@[EMAIL PROTECTED]  2008-04-22 11:34:23 
Re: Group by more efficient than distinct?
mark@[EMAIL PROTECTED] (  2008-04-22 08:01:20 
Re: Group by more efficient than distinct?
matthew@[EMAIL PROTECTED]  2008-04-22 13:22:20 
Re: Group by more efficient than distinct?
mark@[EMAIL PROTECTED] (  2008-04-22 09:04:30 

Post A Reply:
  Go here to Signup

AddThis Feed Button


About - Advertising - Contact - Frequently Asked Questions - Privacy Policy - Terms of Use - Signup

Contact
tan12V112 Mon Dec 1 9:05:16 CST 2008.