Merge branch 'master' of github.com:taddeus/licenseplates

ad483f62 · Richard Torenvliet · f2cbff11 · f96d6652 · ad483f62
Commit ad483f62 authored Dec 21, 2011 by Richard Torenvliet
Show whitespace changes
Inline Side-by-side

Showing with 40 additions and 38 deletions

docs/report.tex docs/report.tex +40 -38

No files found.
--- a/docs/report.tex
+++ b/docs/report.tex
@@ -162,20 +162,19 @@ The outcome of this operations will be a binary pattern. Note that the
 mathematical expression has the same effect as the bit shifting operation that
 we defined earlier.
-\item Given this pattern, the next step is to divide the pattern in cells. The
+\item Given this pattern for each pixel, the next step is to divide the image
-amount of cells depends on the quality of the result, so trial and error is in
+into cells.
-order. Starting with dividing the pattern in to cells of size 16.
 \item Compute a histogram for each cell.
 \begin{figure}[H]
    \center
    \includegraphics[scale=0.7]{cells.png}
-    \caption{Divide in cells(Pietik\"ainen et all (2011))}
+    \caption{Divide into cells (Pietik\"ainen et all (2011))}
 \end{figure}
-\item Consider every histogram as a vector element and concatenate these. The
+\item Consider every histogram a vector element and concatenate all histograms.
-result is a feature vector of the image.
+The concatenation is the feature vector of the image.
 \item Feed these vectors to a support vector machine. The SVM will ``learn''
 which vectors to associate with a character.
@@ -329,25 +328,28 @@ For the classification, we use a standard Python Support Vector Machine,
 \texttt{libsvm}. This is an often used SVM, and should allow us to simply feed
 data from the LBP and Feature Vector steps into the SVM and receive results.
-Using a SVM has two steps. First, the SVM has to be trained, and then it can be
-used to classify data. The training step takes a lot of time, but luckily
-\texttt{libsvm} offers us an opportunity to save a trained SVM. This means that
+Usage a SVM can be divided in two steps. First, the SVM has to be trained
-the SVM only has to be changed once.
+before it can be used to classify data. The training step takes a lot of time,
+but luckily \texttt{libsvm} offers us an opportunity to save a trained SVM.
-We have decided to only include a character in the system if the SVM can be
+This means that the SVM only has to be created once, and can be saved for later
-trained with at least 70 examples. This is done automatically, by splitting the
+usage.
-data set in a learning set and a test set, where the first 70 examples of a
-character are added to the learning set, and all the following examples are
+We have decided only to include a character in the system if the SVM can be
-added to the test set. Therefore, if there are not enough examples, all
+trained with 70 examples. This is done automatically, by splitting the data set
-available examples end up in the learning set, and non of these characters end
+in a learning set and a test set, where the first 70 occurrences of a character
-up in the test set, thus they do not decrease our score. However, if this
+are added to the learning set, and all the following are added to the test set.
-character later does get offered to the system, the training is as good as
+Therefore, if there are not enough examples, all available occurrences end up
-possible, since it is trained with all available characters.
+in the learning set, and non of these characters end up in the test set. Thus,
+they do not decrease our score. If such a character would be offered to the
+system (which it will not be in out own test program), the SVM will recognize
+it as good as possible because all occurrences are in the learning set.
 \subsection{Supporting Scripts}
-In order to work with the code, we wrote a number of scripts. Each of these
+To be able to use the code efficiently, we wrote a number of scripts. This
-scripts is named here and a description is given on what the script does.
+section describes the purpose and usage of each script.
 \subsection*{\texttt{create\_characters.py}}
@@ -378,18 +380,18 @@ scripts is named here and a description is given on what the script does.
 Now that we have a functioning system, we need to tune it to work properly for
 license plates. This means we need to find the parameters. Throughout the
 program we have a number of parameters for which no standard choice is
-available. These parameters are:\\
+available. These parameters are:
-\\
 \begin{tabular}{l|l}
-	Parameter 			& Description\\
+	Parameter 			& Description \\
 	\hline
-	$\sigma$  			& The size of the Gaussian blur.\\
+	$\sigma$  			& The size of the Gaussian blur. \\
 	\emph{cell size}	& The size of a cell for which a histogram of LBP's
-	                      will be generated.\\
+	                      will be generated. \\
-	\emph{Neighbourhood}& The neighbourhood to use for creating the LBP.\\
+	\emph{Neighbourhood}& The neighbourhood to use for creating the LBP. \\
-	$\gamma$			& Parameter for the Radial kernel used in the SVM.\\
+	$\gamma$			& Parameter for the Radial kernel used in the SVM. \\
 	$c$					& The soft margin of the SVM. Allows how much training
-						  errors are accepted.\\
+						  errors are accepted. \\
 \end{tabular}
 For each of these parameters, we will describe how we searched for a good
@@ -446,7 +448,7 @@ reached with the following neighbourhood, which we will call the
 \subsection{Parameters $\gamma$ \& $c$}
 The parameters $\gamma$ and $c$ are used for the SVM. $c$ is a standard
-parameter for each type of SVM, called the 'soft margin'. This indicates how
+parameter for each type of SVM, called the `soft margin'. This indicates how
 exact each element in the learning set should be taken. A large soft margin
 means that an element in the learning set that accidentally has a completely
 different feature vector than expected, due to noise for example, is not taken
@@ -463,7 +465,7 @@ the highest score is then used as our parameters, and the entire SVM will be
 trained using those parameters.
 The results of this grid-search are shown in the following table. The values
-in the table are rounded percentages, for easy displaying.
+in the table are rounded percentages, for better readability.
 \begin{tabular}{|r|r r r r r r r r r r|}
 \hline
@@ -493,10 +495,10 @@ $2^{13}$ &       90 &       92 &       92 &       92 &       92 &
 $2^{15}$ &       92 &       92 &       92 &       92 &       92 &
       92 &       93 &       93 &       86 &       45\\
 \hline
-\end{tabular}
+\end{tabular} \\
-We found that the best values for these parameters are $c = 32$ and
+The grid-search shows that the best values for these parameters are $c = 2^5 =
-$\gamma = 0.125$.
+32$ and $\gamma = 2^{-3} = 0.125$.
 \section{Results}
@@ -535,9 +537,9 @@ there.
 The speed of a classification turned out to be reasonably good. We time between
 the moment a character has been 'cut out' of the image, so we have a exact
 image of a character, to the moment where the SVM tells us what character it
-is. This time is on average $65$ ms. That means that this
+is. This time is on average $65ms$. That means that this technique (tested on
-technique (tested on an AMD Phenom II X4 955 CPU running at 3.2 GHz)
+an AMD Phenom II X4 955 CPU running at 3.2 GHz) can identify 15 characters per
-can identify 15 characters per second.
+second.
 This is not spectacular considering the amount of calculating power this CPU
 can offer, but it is still fairly reasonable. Of course, this program is