Software Name:
Arch Search Engine
Version: 1.9.2
Category: Miscellaneous
Date Listed/Updated: 2022-11- 07:35:
File size: 23338 KB
OS: Win2000,WinXP,Win7 x32,Win7 x64,Windows 8,Windows 10,WinServer,WinOther,WinVista,WinVista x64
License: freeware Price($): 0
Author/Publisher name: Arkadi Kosmynin
View Full Screenshot
Description: Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google s global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great. Don t believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and or Google (on the public part of your site) using a blind test methodology.
In addition to the excellent search quality, Arch has many features critical for corporate environments:
- Document level security. Users can find only documents that they are authorized to see.
- Inexpensive index updates. Arch is able to keep indexes up to date and avoid regular complete site recrawling.
- 24 7 availabilty. There is always a working index available, even if a crawl fails.
- Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy.
- An automatically generated site directory.
- Low cost support once deployed.
- Dual interface (PHP and Java) for easy deployment and customization.
- Faceted search out of the box .
- An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc.
- A modular, plugin-based architecture that can be easily customized and extended.
- The source code is included.
- High performance and scalability. Arch can run on computer clusters to index very large data sets.
We have affiliation with number one software market place Share-IT\. Contact us for better pricing/customised coupon code
Use Avangate Coupoun code 548AAC3EB7 to get maximum discount. Please contact through skype: softrevu or send mail to submit@softrevu.com for better pricing
EULA
Tags: Intranet search corporate search search engine intranet search engine corporate search engine N
Is this software spam? Report Spam
Software removal request by publisher Removal Request
Software Review: Arch Search Engine Review
For publisher: Request Software Confirmation
Press release from the publisher:We compare Google Search Appliance with the enterprise search engine Arch (http: www.atnf.csiro.au computing software arch ) that we developed at CSIRO and license under a CSIRO Open Source software licence, and argue that Arch can be a good replacement for GSA. This comparison covers the essential criteria that influence the cost of the solution vs its usefulness, i.e. value.
- Scalability: both Arch and GSA can work on clusters of computers and offer unlimited scale. The difference is in the price you pay. Arch is free, GSA is $32K - just for one node.
- Cost of deployment and maintenance: both are easy to deploy and maintain, and offer almost a ?turnkey? solution in simple cases. We discussed this topic in article ?Enterprise Search Engine in 15 Minutes?? (http: www.atnf.csiro.au computing software arch ArchWebArticle3.pdf)
- Query power: GSA supports wildcard searches, spelling correction and ordering on a set of document attributes. Arch offers full power of Apache Solr and very powerful Lucene query syntax.
- Supported types of indexed documents: both GSA and Arch offer a set of parsers that cover all common document formats.
- Supported types of document sources: Both GSA and Arch are able to index non-web data, such as the contents of relational databases. Apache ManifoldCF is a connector framework providing Solr connectors that let Arch index data residing in enterprise data repositories, such as FileNet P8, Documentum, LiveLink, Meridio, Windows Shares, SharePoint, relational databases and others.
- Index completeness: With web log processing enabled, Arch is able to provide a more complete index than GSA by finding isolated web pages that normal web crawling algorithms, including those used by GSA, will not find.
- Security: both products support document level access control. Arch also supports an unlimited number of secure search gateways that can serve pre-filtered queries to narrow search for security or relevance reasons.
- Flexibility: both products have clearly defined APIs and extension points, but, being an open source software package, Arch is more modifiable, extendable, and therefore more flexible, able to accommodate virtually any custom requirements.
- Relevance of results: arguably, this is the most important criterion that makes a difference between success and failure of the search engine. See Corporate Search: Can We Just Get Google? (http: www.atnf.csiro.au computing software arch ArchWebArticle.pdf). On a set of 47 test queries and over 100K documents, Arch overperformed GSA by about 10% on average.
Arch and GSA are comparable by the criteria above. However, being open source and thus more flexible, Arch may provide a solution in some cases where GSA can t. As Arch is free, flexible, and provides at least comparable to GSA performance in relevance, the most important quality criterion for a search engine, it clearly represents much better value for money for most use cases.
Visit the Press Release for more details
Software from the publisher:Arch Search Engine,