14 лет назад · f82f3ffa4a
--- a/docs/report.tex
+++ b/docs/report.tex
@@ -352,63 +352,71 @@ it as good as possible because all occurrences are in the learning set.
 
															 To be able to use the code efficiently, we wrote a number of scripts. This
														
 
															 section describes the purpose and usage of each script. For each script it is
														
 
															-essential that you use the correct folder and subfolder naming scheme. The scheme
														
 
															-is as follows:
														
 
															+essential that you use the correct folder and subfolder naming scheme. The
														
 
															+scheme is as follows:
														
 
															 \begin{enumerate}
														
 
															-    \item A main folder called `images' placed in the current directory as the
														
 
															-    src folder.
														
 
															+    \item A main folder called `images' placed in the root directory.
														
 
															     \item In the images folder there have to be three folders.  Images, Infos
														
 
															-    and LearningSet.
														
 
															-    \item The Images and Infos folder contain subfolders which are numbered
														
 
															+    characters
														
 
															+    \item The Images and Infos folder contain subdirectories which are numbered
														
 
															     ($0001$ to possibly $9999$).
														
 
															-    \item In each of the subfolders the data (i.e the images or xml files) can
														
 
															-    be placed.  And have to be named $00991_XXXXX.ext$, where XXXXX can be
														
 
															+    \item In each of the subdirectories the data (i.e the images or xml files)
														
 
															+    can be placed. And have to be named $00991_XXXXX.ext$, where XXXXX can be
														
 
															     $00000 to 99999$.
														
 
															-    \item For-loops in the script currently only go up to 9 subfolders, with a
														
 
															-    maximum of containing 100 images or xml files. These numbers have to be
														
 
															-    adjusted if the scripts are being used, but with a bigger dataset.
														
 
															+    \item For-loops in the script currently only go up to 9 subdirectories,
														
 
															+    with a maximum of containing 100 images or xml files. These numbers have to
														
 
															+    be adjusted if the scripts are being used, but with a bigger dataset.
														
 
															 \end{enumerate}
														
 
															-It is of course possible to use your own naming scheme. A search for the
														
 
															-$filename$ variable will most likely find the occurences where the naming
														
 
															-scheme is implemented.
														
 
															-
														
 
															-
														
 
															 \subsection*{\texttt{create\_characters.py}}
														
 
															-
														
 
															+Generates a file containing character objects with their feature vectors. Also,
														
 
															+the learning set and test set files are created for the given combination of
														
 
															+NEIGHBOURS and BLUR\_SCALE.
														
 
															 \subsection*{\texttt{create\_classifier.py}}
														
 
															-
														
 
															+Generates a file containing a classifier object for the given combination of
														
 
															+NEIGHBOURS and BLUR\_SCALE. The script uses functions from
														
 
															+\texttt{create\_characters.py} to ensure that the required character files
														
 
															+exist first. Therefore, \texttt{create\_characters.py} does not need to
														
 
															+executed manually first.
														
 
															 \subsection*{\texttt{find\_svm\_params.py}}
														
 
															+Performs a grid-search to find the optimal value for \texttt{c} and
														
 
															+\texttt{gamma}, for the given combination of NEIGHBOURS and BLUR\_SCALE. The
														
 
															+optimal classifier is saved in
														
 
															+\emph{data/classifier\_\{BLUR\_SCALE\}\_\{NEIGBOURS\}.dat}, and the accuracy
														
 
															+scores are saved in in
														
 
															+\emph{results/results\_\{BLUR\_SCALE\}\_\{NEIGBOURS\}.txt}.
														
 
															+
														
 
															+Like \texttt{create\_classifier.py}, the script ensures that the required
														
 
															+character object files exist first.
														
 
															+
														
 
															+\subsection*{\texttt{run\_classifier.py}}
														
 
															+Runs the classifier that has been saved in
														
 
															+\emph{data/classifier\_\{BLUR\_SCALE\}\_\{NEIGBOURS\}.dat}. If the classifier
														
 
															+file does not exist yet, a C and GAMMA can be specified so that it is created.
														
 
															+Therefore, it is not necessary to run \texttt{create\_classifier.py} first.
														
 
															 \subsection*{\texttt{generate\_learning\_set.py}}
														
 
															 Usage of this script could be minimal, since you only need to extract the
														
 
															-letters carefully and succesfully once. Then other scripts in this list can use
														
 
															-the extracted images. Most likely the other scripts will use caching to speed
														
 
															-up the system to. But in short, the script will create images of a single
														
 
															+letters carefully and successfully once. Then other scripts in this list can
														
 
															+use the extracted images. Most likely the other scripts will use caching to
														
 
															+speed up the system to. But in short, the script will create images of a single
														
 
															 character based on a given dataset of license plate images and corresponding
														
 
															-xml files. If the xml files give correct locations of the characters they can
														
 
															-be extracted. The workhorse of this script is $plate =
														
 
															-xml_to_LicensePlate(filename, save_character=1)$. Where
														
 
															+XML files. If the XML files give correct locations of the characters they can
														
 
															+be extracted. The workhorse of this script is \texttt{plate =
														
 
															+xml\_to\_LicensePlate(filename, save\_character=1)}. Where
														
 
															 \texttt{save\_character} is an optional variable. If set it will save the image
														
 
															-in the LearningSet folder and pick the correct subfolder based on the character
														
 
															-value. So if the XML says a character is an 'A' it will be placed in the 'A'
														
 
															-folder. These folders will be created automatically if they do not exist yet.
														
 
															-
														
 
															-\subsection*{\texttt{load\_learning\_set.py}}
														
 
															-
														
 
															-
														
 
															-
														
 
															-\subsection*{\texttt{run\_classifier.py}}
														
 
															-
														
 
															-
														
 
															+in the characters folder and pick the correct subdirectory based on the
														
 
															+character value. So if the XML says a character is an 'A' it will be placed in
														
 
															+the `A' folder. These folders will be created automatically if they do not
														
 
															+exist yet.
														
 
															 \section{Finding parameters}