Darryl Gove's Blog: Finding zeros in an array

Friday, January 30, 2015

Finding zeros in an array

A common thing to want to do is to find zero values in an array. This is obviously necessary for string length. So we'll start out with a test harness and a simple implementation:

#include "timing.h"

unsigned int len(char* array)
{
  unsigned int length = 0;
  while( array[length] ) { length++; }
  return length;
}

#define COUNT 100000
void main()
{
  char array[ COUNT ];
  for (int i=1; i<COUNT; i++)
  {
    array[i-1] = 'a';
    array[i] = 0;
    if ( i != len(array) ) { printf( "Error at %i\n", i ); }
  }
  starttime();
  for (int i=1; i<COUNT; i++)
  {
    array[i-1] = 'a';
    array[i] = 0;
    len(array);
  }
  endtime(COUNT);
}

A chap called Alan Mycroft came up with a very neat algorithm to simultaneously examine multiple bytes and determine whether there is a zero in them. His algorithm starts off with the idea that there are two conditions that need to be true if a byte contains the value zero. First of all the upper bit of the byte must be zero, this is true for zero and all values less than 128, so on its own it is not sufficient. The second characteristic is that if one is subtracted from the value, then the upper bit must be one. This is true for zero and all values greater than 128. Although both conditions are individually satisfied by multiple values, the only value that satisfies both conditions is zero.

The following code uses the Mycroft test for a string length implementation. The code contains a pre-loop to get to an eight byte aligned address.

unsigned int len2(char* array)
{
  unsigned int length = 0;
  // Handle misaligned data
  while ( ( (unsigned long long) & array[length] ) &7 )
  {
    if ( array[length] == 0 ) { return length; }
    length++;
  }

  unsigned long long * p = (unsigned long long *) & array[length];
  unsigned long long v8, v7;
  do
  {
    v8 = *p;
    v7 = v8 - 0x0101010101010101;
    v7 = (v7 & ~v8) & 0x8080808080808080;
    p++;
  }
  while ( !v7 );
  length = (char*)p - array-8;
  while ( array[length] ) { length++; }
  return length;
}

The algorithm has one weak point. It does not always report exactly which byte is zero, just that there is a zero byte somewhere. Hence the final loop where we work out exactly which byte is zero.

It is a trivial extension to use this to search for a byte of any value. If we XOR the input vector with a vector of bytes containing the target value, then we get a zero byte where the target value occurs, and a non-zero byte everywhere else.

It is also easy to extend the code to search for other zero bit patterns. For example, if we want to find zero nibbles (ie 4 bit values), then we can change the constants to be 0x1111111111111111 and 0x8888888888888888.

5 comments:

CharlotteNovember 19, 2021 at 9:30 AM
Your style is unique compared to other people from whom I have read material. Thanks a lot for posting when I get a chance, guess I'll bookmark this site. Is a visa required for Turkey. Yes, you may need a visa for Turkey before entering Turkey. The Turkey evisa process is fast and efficient. you can apply for a Turkey visa online.
ReplyDelete
Replies
AnonymousMay 25, 2022 at 7:58 PM
smm panel
Smm panel
iş ilanları
instagram takipçi satın al
hirdavatciburada.com
Https://www.beyazesyateknikservisi.com.tr/
servis
tiktok jeton hilesi
ReplyDelete
Replies
SophiaMay 27, 2022 at 4:37 AM
I'm glad to be visiting your blog again after months of not doing so. It's good to see this article that I've been waiting so long for. Travelers who are planning to visit Turkey. It is important for them to be aware of the Turkish visa requirements to avoid an inconvenience during the visa application process.
ReplyDelete
Replies
AnonymousJune 3, 2022 at 12:59 PM
üsküdar lg klima servisi
kartal samsung klima servisi
maltepe arçelik klima servisi
kadıköy arçelik klima servisi
ümraniye arçelik klima servisi
beykoz samsung klima servisi
üsküdar samsung klima servisi
beykoz mitsubishi klima servisi
üsküdar mitsubishi klima servisi
ReplyDelete
Replies
AnonymousJune 27, 2022 at 10:38 AM
yurtdışı kargo
lisans satın al
en son çıkan perde modelleri
özel ambulans
nft nasıl alınır
en son çıkan perde modelleri
uc satın al
minecraft premium
ReplyDelete
Replies

Add comment

Darryl Gove's Blog

Friday, January 30, 2015

Finding zeros in an array

5 comments:

Multicore Application Programming

Multicore Application Programming resources

Solaris Application Programming

Solaris Application Programming resources

The Developer's Edge

The Developer's Edge resources

OpenSPARC Internals

OpenSPARC Internals resources

Archive

Other blogs

Darryl Gove's Blog

Friday, January 30, 2015

Finding zeros in an array

5 comments:

Multicore Application Programming

Multicore Application Programming resources

Solaris Application Programming

Solaris Application Programming resources

The Developer's Edge

The Developer's Edge resources

OpenSPARC Internals

OpenSPARC Internals resources

Archive

Other blogs

Subscribe