Editor: This is the program intended for search files on the local hard drive of the computer. StopKa scan the content of selected folders and indexing all text information, which the program able to Extract - (now it support next formats: txt (all plain text format extensions [example: cpp, h]), html, doc, rtf, xls, ppt, pdf, djvu; in plans: ps, mht, chm). There is possibility to add support of your format (plugin mechanism).
Features:
high indexing speed (look at Review), momentary search, ranging the search result accordingly user demands, flexible indexing options, simple graphical user interface, imbeddable classification possibilities, free search engine.
Indexer tuning
Initially StopKa should indexing all selected folders on local hard drive, then you could search. Beside text from the file it is possible to include into the index some more information for example some information about the file - such as file name, date of creation/modification, CRC32, Md5sum, swich on/off indexing of some fields for some files formats.
Classificator tuning
In order to program automatically categorize some document to some class of documents, it is necessary accomplish series of simple Steps. Most difficult thing here is to prepare the corpus of files (amount of files separated into classes). Currently the corpus is a set of files in folders (name of the folder is a name of the class).
Using the search engine
You can freely use the resulted index file for your needs in your programs. For this there is a dll library and interface to use it. The result of the functions call is a ranging list of objects.