A simple way of speeding up impscheck for dense canditers #6847

monetdb-team · 2020-11-30T16:55:16Z

Date: 2020-04-06 02:24:45 +0200
From: Nenya Edjah <<nenya_edjah>>
To: GDK devs <>
Version: -- development

Last updated: 2020-06-03 16:58:52 +0200

Comment 27647

Date: 2020-04-06 02:24:45 +0200
From: Nenya Edjah <<nenya_edjah>>

User-Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.79 Safari/537.36
Build Identifier:

In the impscheck() macro, there is an:

 if (im[icnt] & mask)

check to determine if the corresponding segments of the data should be scanned.

In the else case, we essentially skip over those irrelevant cache lines with a while loop:

 while (p < ci->ncand && o < e) {         
     p++;                                 
     o = canditer_next(ci);               
 }

For dense canditers, this is inefficient since the canditer_next(ci) operation basically just does some arithmetic operations. So, it doesn't make sense to waste CPU cycles by iterating through a while loop. Especially if the amount of data being skipped is large.

A simple optimization is to replace the snippet above with the following:

 if (ci->tpe == cand_dense) {                 
     BUN skip_sz = MIN(ci->ncand - p, e - o); 
     p += skip_sz;                            
     o += skip_sz;                            
     ci->next += skip_sz;                     
 } else {                                     
     while (p < ci->ncand && o < e) {         
         p++;                                 
         o = canditer_next(ci);               
     }                                        
 }

I implemented this change and ran a test on a nearly sorted column with 100m elements. When using a single CPU, this reduces the average query latency from 140ms to 4ms.

Since this is simple change, I just wanted to let you guys know about it in case you wanted to get it implemented in a future version MonetDB.

Best,
Nenya

Reproducible: Always

Comment 27648

Date: 2020-04-06 09:50:54 +0200
From: MonetDB Mercurial Repository <>

Changeset b0e2acf99f2f made by Sjoerd Mullender sjoerd@acm.org in the MonetDB repo, refers to this bug.

For complete details, see https//devmonetdborg/hg/MonetDB?cmd=changeset;node=b0e2acf99f2f

Changeset description:

Speed up imprints check for dense candidate lists.
Thanks lots to Nenya Edjah for the report and the core of the change.
This fixes bug #6847.

Comment 27649

Date: 2020-04-06 09:52:17 +0200
From: @sjoerdmullender

Thanks for the analysis and patch.
As you can see, I made some bigger changes, but in essence, it's your patch.

Comment 27650

Date: 2020-04-06 12:27:47 +0200
From: Nenya Edjah <<nenya_edjah>>

:D

The text was updated successfully, but these errors were encountered:

monetdb-team added enhancement New feature or request GDK Kernel labels Nov 30, 2020

monetdb-team closed this as completed Nov 30, 2020

sjoerdmullender added this to the Ancient Release milestone Feb 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A simple way of speeding up impscheck for dense canditers #6847

A simple way of speeding up impscheck for dense canditers #6847

monetdb-team commented Nov 30, 2020

A simple way of speeding up impscheck for dense canditers #6847

A simple way of speeding up impscheck for dense canditers #6847

Comments

monetdb-team commented Nov 30, 2020

Comment 27647

Comment 27648

Comment 27649

Comment 27650