liujoycec.github.io Open in urlscan Pro
2606:50c0:8003::153 Public Scan

Back to summary
URL:
http://liujoycec.github.io/
Submission: On March 19 via api (March 19th 2024, 3:20:04 pm UTC) from US — Scanned from DE
Form analysis
0 forms found in the DOM

Text Content

JOYCE LIU

Software developer, math geek, and foreign language enthusiast living in Seattle

Home About GitHub

© 2015. All rights reserved.


USING SYMMETRY TO OPTIMIZE AN N-QUEENS COUNTING ALGORITHM

20 Sep 2015 - 3 Comments

November 7, 2015 Update: I have now created an interactive visualization tool to
walk you through the execution of my algorithm for those of you who are visual
learners or those of you who just like cool stuff. Check it out!. If you haven't
yet read this post to learn about my awesome N-Queens counting algorithm, I hope
you enjoy it!

Several weeks ago, I was introduced to the N-Queens counting problem, and I got
to solve it using bitwise operation in Javascript. While it was fun to solve it
on my own, I also wanted to see what existing efficient solutions were out there
so I could learn how to improve my own algorithm. I stumbled upon this blog by
Greg Trowbridge, which presents a Javascript variation of the N-Queens counting
algorithm found in a paper by Martin Richards (N-Queens is discussed on pages
2-4 of the paper). This algorithm is highly efficient and has some really cool
optimizations that mine didn't have, but I noticed that it didn't take advantage
of any symmetry optimizations. Thus began my quest to modify an already
awesomely efficient algorithm in order to cut its time down by a half...

This next section gives a brief introduction to the N-Queens puzzle and how
bitwise operation can be used to represent a chessboard. If you are already
familiar with these, feel free to skip to the section after it.


INTRODUCTION TO N-QUEENS AND BITWISE OPERATION IN JAVASCRIPT

The N-Queens problem is a puzzle in which you are given an N-by-N chessboard,
and you must place exactly N queens on it in such a way that none of the queens
can attack each other in one move (remember that the queen can attack any piece
that is in the same row, column, or diagonal). For a large N, just finding one
solution can be quite daunting for a human but is a fairly quick task for a
computer. The real challenge is counting all of the unique solutions that exist
for a particular N. Unlike finding the number of ways to place N rooks on an
N-by-N chessboard, which is just simply N! (N factorial), there isn't any known
mathematical formula to quickly calculate the number of N-Queens solutions. So
the only way to count the solutions is to actually find every solution. This
exhausts the computer's resources before the program finishes for very large N.
A standard 8x8 chessboard has 92 distinct solutions, and when N=17, there are
95,815,104 distinct solutions! According to this blogpost by Paul Sokolik as of
June 2015, the largest N to date for which all the distinct solutions have been
enumerated is N=26. The computation took about 9 months using 11
super-computers! This is why this puzzle has become so famous in the computer
science community, because every optimization in the computer and algorithm
counts.



For those that have not seen the N-Queens problem solved using bitwise
operation, here's a brief summary of how the chessboard is represented. Bitwise
operators are very fast because they directly operate on individual bits, and a
squence of bits can visually represent a row on a chessboard, making bitwise
operation ideal for the N-Queens problem. Let's use a 4x4 chessboard as an
example. Each row in the chessboard is represented by a single binary number,
which is just a sequence of bits. If a queen is placed in the leftmost square of
a row, the row is represented by the number 8, which in binary is 1000, because
the 1 is in the leftmost spot in this sequence. If a queen is in the 3rd square
from the left, then the row is represented by 2, or 0010. The numbers other than
1, 2, 4, and 8 can be used to represent occupation in more than one square, such
as 9 -> 1001, or 15 -> 1111. This is useful for marking squares which would
cause conflict (squares which are in the same column or diagonal as another
queen in a different row). For example, if our conflict sequence is 5 (0101), it
would indicate that the only open squares left that won't conflict with other
queens already on the board are the 1st and 3rd squares from the left, so those
two squares will be the only ones where we try to place the next queen. When you
get to a row where every square is in conflict (1111), then you know you've gone
down the wrong path which will not lead to a valid solution, so you then
backtrack up a row to place the previous queen somewhere else. If the previous
queen can't be placed anywhere else, then you must keep backtracking up further
to change other queens' positions until you find a solution where all the queens
are happy :)

For reference, here is a list of the bitwise operators in Javascript.


A GREAT BITWISE SOLUTION IN JAVASCRIPT...

I highly recommend reading the above mentioned blogpost by Greg Trowbridge, as
the algorithm presented in it is pretty nifty, and the blogpost does a great job
of thoroughly explaining how it works. The next section will assume you
understand how this algorithm works, so it's a good idea to read the blogpost if
you don't.

For reference, this is the solution presented in the blogpost:


countNQueensSolutions = function(n) {
  //Keeps track of the # of valid solutions
  var count = 0;

  //Helps identify valid solutions
  var done = Math.pow(2,n) - 1;

  //Checks all possible board configurations
  var innerRecurse = function(ld, col, rd) {

    //All columns are occupied,
    //so the solution must be complete
    if (col === done) {
      count++;
      return;
    }

    //Gets a bit sequence with "1"s
    //whereever there is an open "slot"
    var poss = ~(ld | rd | col);

    //Loops as long as there is a valid
    //place to put another queen.
    while ( poss & done ) {
      var bit = poss & -poss;
      poss -= bit;
      innerRecurse((ld|bit)>>1, col|bit, (rd|bit)<<1);
    }
  };

  innerRecurse(0,0,0);

  return count;
};


In comparing my original solution to this one, I found a really cool
optimization here that my first solution didn't have, in the line that says "var
bit = poss & -poss". In the binary representation of the variable poss, the
zeroes denote spaces in conflict, and the ones are the remaining possibilities.
(poss is defined to be the inverse of the conflict sequence, so it is the
opposite of what I described in the section above). With a simple binary
operation, bit gives us the position of the first square from the right that is
available, without having to iterate over each square to check if it's open. For
example, if poss is 01010000, then bit is 00010000, and there was no need to
waste time and computing resources on iterating over the first four spots to
check if they are open (or rather, iterating over the powers of two, which
represent the spots). This saves a lot of time when you get to the lower rows
where most of the squares have some conflict with queens in the rows above.

However, I noticed that this solution did not take advantage of one
optimization: symmetry. So I decided to rewrite it to include this optimization.


...AND HOW I IMPROVED UPON IT USING SYMMETRY

When we have a valid N-Queens solution, the mirror image of it will obviously
still be a valid solution. What's more, this actually counts as a distinct
solution from the first one, even though all we did was flip the board over! As
long as we can be certain that no mirror image is identical to any other
solution we've found, then we can just find half of the solutions and multiply
the count by 2. When N is even, we can just filter out one half of the first
row, knowing that the solutions we miss out on will have their mirror images
found when we explore the other half of the row. The case when N is odd is a
tiny bit trickier, since we can't divide the odd number of squares in the first
row by 2. For all solutions where the queen in the first row is not in the
middle square, we can still find half of those solutions and multiply by 2. But
it turns out we can do the same thing when the first row has its middle square
occupied. When there is a queen in the middle square of the first row, then
there can't be a queen in the middle square of the second row because then it
would be in the same column as the first queen. Now there are an even number of
squares in the second row that are still available! We can just exclude half of
the remaining squares in the second row so that we find exactly half of the
solutions in which the first queen is in the middle. Add that to half of the
solutions where the first queen is not in the middle, and we get exactly half of
all solutions, which we then multiply by 2. Voila!

To illustrate:

We will exclude the right half of the first row. For odd N, this means up to,
but not including, the middle square. This filter will prevent us from finding
solutions such as the this one:



But that's ok, because we will find its mirror image, and multiply the count by
2:



However, this would cause us to double-count solutions where the queen in the
first row is in the center square. The following solution and its mirror image
would both be counted. And if we chose to exclude the middle square in the first
row, then neither would be counted. We want exactly one of these to be counted.



So instead, we will add a conditional filter to exclude the right half of the
second row, and this filter will only be applied when the queen in the first row
is in the middle square. We will not have to worry about the middle square in
the second row because it is in conflict with the first queen, anyway. This way,
we count exactly half of the solutions for which the first queen is in the
middle, and we can multiply that count by 2.



Here is my revised version of the algorithm that was shown above:


modifiedCountNQueensSolutions = function(n) {
  //Symmetry will not work for N=1 and N=0 because
  //the one solution's mirror image is itself
  if (n === 0 || n === 1) return 1;

  //Keeps track of the # of valid solutions
  var count = 0;

  //Helps identify valid solutions
  //Equivalent to Math.pow(2,n) - 1
  var done = (1 << n) - 1;

  //Determines the positions in the first row
  //that will be excluded from our search
  //Also applies to the second row when N is
  //odd and the first queen is in the middle
  //Equivalent to Math.pow(2, Math.floor(n/2)) - 1
  var excl = (1 << ((n/2)^0)) - 1;

  //Checks all possible board configurations
  //Added two parameters: ex1 will be used on
  //the current row, ex2 is next in line
  var innerRecurse = function(ld, col, rd, ex1, ex2) {

    //All columns are occupied,
    //so the solution must be complete
    if (col === done) {
      count++;
      return;
    }

    //Gets a bit sequence with "1"s
    //whereever there is an open "slot"
    //ex1 filters out right half of row
    var poss = ~(ld | rd | col | ex1) & done;

    //Loops as long as there is a valid
    //place to put another queen.
    while (poss) {
      var bit = poss & -poss;
      poss = poss^bit;

      //ex2 will become the next row's ex1
      //All rows after that will have ex1 = 0
      innerRecurse((ld|bit)>>1, col|bit, (rd|bit)<<1, ex2, 0);

      //After we are past the middle square in the
      //first row, disable filter for second row
      ex2 = 0;
    }
  };

  //Second row filter active only for odd N
  innerRecurse(0, 0, 0, excl, n%2 ? excl : 0);

  //Multiply count by 2
  return count<<1;
};


The major change that was made to the algorithm is the addition of the
parameters ex1 and ex2 to the innerRecurse function. These are the exclusion
filters for the current and next row of the chessboard. On our first call to
innerRecurse, ex1 is set to equal excl which is defined above to represent the
squares in the right half of the row (up to but not including the middle square
for odd N), so our first row in the chessboard will have the exclusion filter.
ex1 is then applied to the list of other conflict sequences (ld, col, rd) to
eliminate unavailable spots in poss.

ex2 is not immediately applied, but the value stored at ex2 is then passed in as
the next ex1 when we call innerRecurse again within itself, while 0 is passed in
as the next ex2 so all future calls will have ex1 equal to 0. This ensures that
the third row and beyond will not have the exclusion filter. On our first call
of innerRecurse, ex2 is set to 0 if N is even and set to excl if N is odd. This
is because we only ever want to filter on the second row if N is odd.
Furthermore, the filter should only apply to the second row when we have a queen
in the middle square of the first row. Thanks to the filter in the first row, we
start off the first row with the queen in the middle square. Before we move the
queen in the first row to the next square, we set ex2 = 0 right before the end
of the while loop. Then the filter will no longer apply to the second row now
that we've moved our queen in the first row past the middle square.


THE RESULT

This solution only has three more lines of code than the first one shown above
(it seems longer because of the additional comments), and to my surprise, it
actually ran in less than half the time than the first solution. I had expected
it to take a little longer than half the time, because I figured the few extra
steps it takes to optimize on symmetry would cause each recursive call to be
slightly slower. I made a few other micro-optimizations which I didn't think
would make much of a difference, but I guess they were enough to more than make
up for any time lost on performing the extra operations for the symmetry
optimization. The micro-optimization that I believe to have made the most
difference is eliminating the bitwise operation inside the while loop condition
and, instead, performing that operation before the loop so it is not
unnecessarily repeated.

Here is a time comparison between the two algorithms (in milliseconds). For each
N, I ran each algorithm 5-7 times in the Chrome browser console on my laptop,
and I chose the median time for each:

NUnmodified AlgorithmModified Algorithm 9Half a ms < Half a ms 101 msHalf a ms
116 ms2 ms 1224 ms9 ms 13114 ms46 ms 14667 ms255 ms 154,077 ms1,570 ms 1627,152
ms10,447 ms

For each N in the table, the modified algorithm, which takes advantage of
symmetry and a few micro-optimizations, finishes in between a third to a half
the amount of time that the unmodified algorithm takes. I originally intended to
cut the time down by a half. Mission over-accomplished!

Thanks for reading, and please feel free to leave comments or questions!

3 Comments
Older Newer
liujoycec.github.io Open in urlscan Pro 2606:50c0:8003::153 Public Scan

Form analysis 0 forms found in the DOM

Text Content

liujoycec.github.io Open in urlscan Pro
2606:50c0:8003::153 Public Scan

Form analysis
0 forms found in the DOM