Commit Graph

10 Commits

Author SHA1 Message Date
Strahinja Val Markovic
ee2bfe8952 Fix for query capital chars in wb ratio condition 2012-06-24 18:13:35 -07:00
Strahinja Val Markovic
d18b89bceb Fixed another bug in the word boundary condition
The problem was that should have been using a longest common subsequence
algorithm for the "number of word boundary character matches" calculation. Our
old approach would fail for the following case:

Query: "caafoo"

Candidate1 : "acaaCaaFooGxx"
Candidate2 : "aCaafoog"

Candidate1 needs to win. This is now also a test case.
2012-06-24 17:58:20 -07:00
Strahinja Val Markovic
9b875ca7f3 New sorting rule: char match index sum
The point is that we want to prefer candidates that have the query characters
"earlier" in their text, e.g. "xxabcxxx" over "xxxxxabc" for "abc" query.
2012-06-24 15:35:00 -07:00
Strahinja Val Markovic
adb31abef2 Fixing a bug with word-boundary ratio calculation 2012-06-24 15:11:58 -07:00
Strahinja Val Markovic
5d6e839118 Newly inserted identifiers are added immediately 2012-05-12 15:20:03 -07:00
Strahinja Val Markovic
a8e9b61f3f Going back to 2-4 threads 2012-05-11 10:09:52 -07:00
Strahinja Val Markovic
6e1c7f8429 Optimized the use of threads somewhat 2012-05-10 22:35:08 -07:00
Strahinja Val Markovic
ba76e3f6e6 List instead of vector for storing candidates
This should be faster because appending to a list is O(1), and only amort. O(1)
when appending to a vector.
2012-05-10 21:56:19 -07:00
Strahinja Val Markovic
680a7ce979 Adding the SYSTEM flag for boost and python headers 2012-05-10 09:57:07 -07:00
Strahinja Val Markovic
0c0ecc24c4 Using custom collection of Boost libs 2012-05-09 21:45:30 -07:00