14 years ago · 0bba6a5d64
--- a/docs/report.tex
+++ b/docs/report.tex
@@ -549,19 +549,6 @@ expectations. \\
 
															 Note: Both tests were executed using an AMD Phenom II X4 955 CPU processor,
														
 
															 running at 3.2 GHz.
														
 
															-This is not spectacular considering the amount of calculating power this CPU
														
 
															-can offer, but it is still fairly reasonable. Of course, this program is
														
 
															-written in Python, and is therefore not nearly as optimized as would be
														
 
															-possible when written in a low-level language.
														
 
															-
														
 
															-Another performance gain is by using one of the other two neighbourhoods.
														
 
															-Since these have 8 points instead of 12 points, this increases performance
														
 
															-drastically, but at the cost of accuracy. With the (8,5)-neighbourhood
														
 
															-we only need 81ms seconds to identify a character. However, the accuracy
														
 
															-drops to $89\%$. When using the (8,3)-neighbourhood, the speedwise performance
														
 
															-remains the same, but accuracy drops even further, so that neighbourhood
														
 
															-is not advisable to use.
														
 
															-
														
 
															 \section{Discussion}
														
 
															 There are a few points open for improvement. These are the following.
														
@@ -580,44 +567,56 @@ The expectation is that using a larger diameter pattern, but with the same
 
															 amount of points is worth trying. The theory behind that is that when using a
														
 
															 gaussian blur to reduce noise, the edges are blurred as well. By
														
 
															-\subsection{Context Information}
														
 
															-
														
 
															-We don't do assumption when a letter is recognized. For instance Dutch licence
														
 
															-plates exist of three blocks, two digits or two characters. Or for the new
														
 
															-licence plates there are three blocks, two digits followed by three characters,
														
 
															-followed by one or two digits. The assumption we can do is when there is have a
														
 
															-case when one digit is most likely to follow by a second digit and not a
														
 
															-character. Maybe these assumption can help in future research to achieve a
														
 
															-higher accuracy rate.
														
 
															-
														
 
															-\subsection{Speed up}
														
 
															-
														
 
															-A possibility to improve the performance speedwise would be to separate the
														
 
															-creation of the Gaussian kernel and the convolution. This way, the kernel can
														
 
															-be cached, which is a big improvement. At this moment, we calculate this kernel
														
 
															-every time a blur is applied to a character. This was done so we could use a
														
 
															-standard Python function, but we realised too late that there is performance
														
 
															-loss due to this.
														
 
															-
														
 
															-Another performance loss was introduced by checking for each pixel if it is
														
 
															-in the image. This induces one function call and four conditional checks
														
 
															-per pixel, which costs performance. A faster method would be to first set a
														
 
															-border of black pixels around the image, so the inImage function is now done
														
 
															-implicitly because it simply finds a black pixel if it falls outside the
														
 
															-original image borders.
														
 
															+\subsection{Context information}
														
 
															+
														
 
															+Unlike existing commercial license plate recognition software, our
														
 
															+implementation makes no use of context information. For instance, Dutch early
														
 
															+license plates consist of three blocks, one of two digits and two of two
														
 
															+letters. More recent Dutch plates also consist of three blocks, two digits
														
 
															+followed by three characters, followed by one or two digits. \\
														
 
															+This information could be used in an extension of our code to increase
														
 
															+accuracy.
														
 
															+
														
 
															+\subsection{Potential speedup}
														
 
															+
														
 
															+One way of gaining time-wise performance is making a smart choice of local
														
 
															+binary pattern. For instance, the (8,3)-neighbourhood has a good performance,
														
 
															+but low accuracy. The (12,8)-neighbourhood yields a high accuracy, but has a
														
 
															+relatively poor performance. As an in-between solution, the (8,5)-neighbourhood
														
 
															+can be used. This has the same time-wise performance as (8,3), but a higher
														
 
															+accuracy. The challenge is to find a combination of (number of points,
														
 
															+neighbourhood size) that suits both accuracy and runtime demands.
														
 
															+
														
 
															+Another possibility to improve the performance speed-wise would be to separate
														
 
															+the creation of the Gaussian kernel and the convolution. This way, the kernel
														
 
															+will not have to be created for each feature vector. This seems to be a trivial
														
 
															+optimization, but due to lack of time we have not been able to implement it.
														
 
															+
														
 
															+Using Python profiling, we learned that a significant percentage of the
														
 
															+execution time is spent in the functions that create the LBP of a pixel. These
														
 
															+functions currently call the \texttt{LocalBinaryPatternizer.is\_pixel\_darker}
														
 
															+function for each comparison, which is expensive in terms of efficiency. The
														
 
															+functions also call \texttt{inImage}, which (obviously) checks if a pixel is
														
 
															+inside the image. This can be avoided by adding a border around the image with
														
 
															+the width of half the neighbourhood size minus one (for example, $\frac{5 -
														
 
															+1}{2} = 2$ pixels in a $5x5$ neighbourhood). When creating the feature vector,
														
 
															+this border should not be iterated over.
														
 
															 \section{Conclusion}
														
 
															-In the end it turns out that using Local Binary Patterns is a promising
														
 
															-technique for License Plate Recognition. It seems to be relatively indifferent
														
 
															-for the amount of dirt on license plates and different fonts on these plates.
														
 
															+It turns out that using Local Binary Patterns is a promising technique for
														
 
															+license plate recognition. It seems to be relatively indifferent of the amount
														
 
															+of dirt on license plates, which means that it is robust. \\
														
 
															+Also, different fonts are recognized quite well, which means that it is well
														
 
															+suited for international use (at country borders, for example).
														
 
															-The performance speed wise is fairly good, when using a fast machine. However,
														
 
															-this is written in Python, which means it is not as efficient as it could be
														
 
															-when using a low-level languages.
														
 
															+Time-wise performance turns out to be better than one would expect from a large
														
 
															+Python program. This gives high hopes for performance in any future
														
 
															+implementation written in a C-like language.
														
 
															-We believe that with further experimentation and development, LBP's can
														
 
															-absolutely be used as a good license plate recognition method.
														
 
															+Given both of the statements above, we believe that with further
														
 
															+experimentation and development, LBP's is absolutely a valid method to be used
														
 
															+in license plate recognition.
														
 
															 \section{Reflection}
														
@@ -630,8 +629,8 @@ were and whether we were able to find a proper solution for them.
 
															 \subsubsection*{Dataset}
														
 
															 We did experience a number of problems with the provided dataset. A number of
														
 
															-these are problems to be expected in a real world problem, but which make
														
 
															-development harder. Others are more elemental problems.
														
 
															+these are problems to be expected in the real world, but which make development
														
 
															+harder. Others are more elemental problems.
														
 
															 The first problem was that the dataset contains a lot of license plates which
														
 
															 are problematic to read, due to excessive amounts of dirt on them. Of course,