In-Q-Tel executive interview, KMWorld "What's nice about Bohica Associates' BACEngine ; an Innovative Data Search Engine, is that with the directed search engine and all of its underlying analytics, it can really deliver a lot of very useful information in ways that a normal search engine could never produce. Bohica Associates' BACEngine represents a leap ahead."
Bohica Associates Corporation tm BACEngine tm
Innovative Data Search Engine Full-text indexing and search of content through XML standards December 2005 The evolution of search technologies, XML,(Extensible Markup Language) is an openstandard, and the world wide web have created a combination of standards that enable dynamic construction of web content through light-weight queries and results. Bohica Associates Corporation has developed a search product, called the BACEngine that leverages these standards and exposes a simple interface to support the index and search of a variety of content through industry standard tools. The basic flow of technology into the BACEngine, Data Search Engine is depicted below.
BACEngine:


IT Consulting / Data Integration

The calling application constructs an XML string to encapsulate the query.
This string contains the following:
Search index to query (e.g. my.db)
Starting hit offset advance for the return set (e.g. 0)
Ending hit offset to return (e.g. 20)
The text query (with our without field searches)
The desired sort (ascending, descending, or unique instances only)
The calling application will either use a socket connection or web service to initiate a search session and pass a query to the engine. The BACEngine is a multi-threaded application that will support multiple simultaneous queries and will return the result in an XML stream that contains the indexed fields for the hits plus the normalized score associated with each hit. In the .NET framework, this XML stream can be bound to a dataset and then exposed in a control in the web page. A web service interface is available for SOAP protocols if desired and is commonly used in apache web servers.
A Boolean thesaurus is available to expand or limit query terms. For example, a thesaurus entry of: agriculture=farming or crop, will expand the term agriculture to be the Boolean union of agriculture, farming, and crop. This allows the search engine to tune the vocabulary of each search index to the application it supports.
Indexing is performed through utilities that can provide incremental or full indexing of the underlying application data while the system is live. Often, the reindex is performed on a schedule and is automated through the windows at command or the UNIX cron command. BAC has timed the indexing of 200,000 SQL Server records on a windows machine to be completed in a few seconds.
BAC also uses this approach to drive our voice interface solutions and services. Any data application that currently has a web interface may be exposed through a speech recognition engine and a toll-free telephone number to allow for transaction processing through a telephone interface. In this instance, the XML is translated to VoiceXML so that is commercial text-to-speech engines may perform speech synthesizing. These engines are capable of producing web GET and POST commands and allow the BAC engine to be the gateway from a speech engine to a data repository with a small amount of custom web page development to format the XML queries. Online catalogue and credit card processing modules have been developed and are available to be delivered as a quick start service package.
The BACEngine has been deployed on CDROM and DVD mediums to enable the indexing, compression, and distribution of content in a stand-alone web application, which is customizable using the open source .NET framework from www.go-mono.com.
BAC tests and distributes a version of mono on the required platforms with a platform specific launcher application for each platform requested by the client. This solution is tested for security and vulnerability using Nessus and other leading hacker tool suites.
![]()

Additionally, Bohica Associates Corporation solves data sharing problems that have plagued our governmental clients for decades. The U.S. Department of Homeland Security (DHS) and the U.S. Department of Justice (DOJ) announced a partnership in February 2005 to develop a single standard foundation for information exchange. The official version of the National Information Exchange Model (NIEM) is scheduled for release by June 1, 2006. NIEM is an enhanced version of the DOJ-sponsored Global Justice XML Data Model (GJXDM) that was released for official use in 2004. The initiative is intended to enable the entire justice and public safety community to share information at the local, state, tribal and national levels. NIEM is an interagency initiative to provide the foundation and building blocks for national-level interoperable information sharing and data exchange. XML (Extensible Markup Language) is an open standard that will support NIEMs ability to permit agencies to exchange data, regardless of computer system or platform.
Bohica Associates Corporations search product, called the BACEngine, leverages these standards and exposes a simple interface to support the index and search of a variety of content through industry standard tools.
Demonstrations of all these technologies are available at the BAC web site,
and free consultations are available upon request.
"Bridging the gap between technology and real world results"
Bohica Associates Corporation
IT Consulting / Data Integration