The query function you’ll write, select
, is loosely modeled on the SELECT
statement from Structured Query Language (SQL). It’ll take five keyword parameters: :from
, :columns
, :where
, :distinct
, and :order-by
. The :from
argument is the table
object you want to query. The :columns
argument specifies which columns should be included in the result. The value should be a list of column names, a single column name, or a **T**
, the default, meaning return all columns. The :where
argument, if provided, should be a function that accepts a row and returns true if it should be included in the results. In a moment, you’ll write two functions, matching
and in
, that return functions appropriate for use as arguments. The :order-by
argument, if supplied, should be a list of column names; the results will be sorted by the named columns. As with the :columns
argument, you can specify a single column using just the name, which is equivalent to a one-item list containing the same name. Finally, the :distinct
argument is a boolean that says whether to eliminate duplicate rows from the results. The default value for :distinct
is **NIL**
.
Here are some examples of using select
:
Of course, the really interesting part of select
is how you implement the functions extractor
, row-equality-tester
, and row-comparator
.
As you can tell by how they’re used, each of these functions must return a function. For instance, project-columns
uses the value returned by extractor
as the function argument to **MAP**
. Since the purpose of is to return a set of rows with only certain column values, you can infer that extractor
returns a function that takes a row as an argument and returns a new row containing only the columns specified in the schema it’s passed. Here’s how you can implement it:
The functions row-equality-tester
and row-comparator
are implemented in a similar way. To decide whether two rows are equivalent, you need to apply the appropriate equality predicate for each column to the appropriate column values. Recall from Chapter 22 that the **LOOP**
clause always
will return **NIL**
as soon as a pair of values fails their test or will cause the **LOOP**
to return **T**
.
Ordering two rows is a bit more complex. In Lisp, comparator functions return true if their first argument should be sorted ahead of the second and **NIL**
otherwise. Thus, a **NIL**
can mean that the second argument should be sorted ahead of the first or that they’re equivalent. You want your row comparators to behave the same way: return **T**
if the first row should be sorted ahead of the second and **NIL**
otherwise.
But if the column comparator returns **NIL**
, then you need to determine whether that’s because the second value should sort ahead of the first value or because they’re equivalent. So you should call the comparator again with the arguments reversed. If the comparator returns true this time, it means the second column value sorts ahead of the first and thus the second row ahead of the first row, so you can return immediately. Otherwise, the column values are equivalent, and you need to move onto the next column. If you get through all the columns without one row’s value ever winning the comparison, then the rows are equivalent, and you return **NIL**
. A function that implements this algorithm looks like this: