Home > Windows Server Tips > > Desktop search engine uses open source tools
Windows Server Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 


Desktop search engine uses open source tools


Serdar Yegulalp, Contributor
03.28.2006
Rating: -2.50- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


As this site's resident desktop management expert, I've written about a few desktop search engines, such as the Google Desktop search application, as well as my current favorite, the Microsoft Outlook plug-in Lookout.

Relatively new to the game is DocSearcher 3.88, a desktop search and indexing tool written entirely in Java. DocSearcher 3.88 uses several open source programming tools -- the PDF Box, Lucene and POI Apache APIs -- to search and index many of the most common types of documents. The tool supports searching HTML, Word, Excel, RTF, PDF, OpenOffice/StarOffice and plain-text documents, and it performs searches within the body or supported metadata (document name, author, etc.) for documents that support it.

The program can also spider a remote Web site and create a locally searchable directory of that site.

DocSearcher requires no installation, only the presence of the Java Runtime Environment, so it can run directly from any directory. After you launch the program's .JAR file, you'll want to create an index for a given document directory. If you already have an index produced by an instance of DocSearcher, you can import it rather than regenerate it from scratch. Indexes can be updated on demand


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary


, when the program launches or after the index has aged a certain number of days. The program's search interface produces a report that can be saved as an HTML file.

The program can also create a self-contained index that you can place on a CD-ROM or DVD along with the application itself, meaning you can essentially package a set of documents on a disk with its own search engine. The program will send a notification in e-mail whenever a given index is updated. It also supports third-party document handlers, so people can write their own search handlers and implement them as needed.

The program still has a few limitations. For one, it cannot sort search results. Another is that it does not yet search Outlook .PST files (although this capability may be added down the road). The tool's author has stated plans to add support for generic XML files, Microsoft Project documents and metadata from common multimedia files (such as MP3 ID3 tags) as well.


Serdar Yegulalp is editor of the Windows Power Users Newsletter. Check it out for the latest advice and musings on the world of Windows network administrators -- and please share your thoughts as well!

More information from SearchWinSystems.com

Rate this Tip
To rate tips, you must be a member of SearchWindowsServer.com.
Register now to start rating these tips. Log in if you are already a member.




DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



Server Room Design - Planning, Cooling, Maintenance
HomeTopicsBlogsITKnowledge ExchangeTipsAsk the ExpertsMultimediaWhite PapersIT Downloads
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2004 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts