Application
Vault- PhpDig
PhpDig is a web spider and search engine written in PHP, using a MySQL
database and flat file support. PhpDig builds a glossary with words
found in indexed pages. On a search query, it displays a result page
containing the search keys, ranked by occurrence.
- HTTP spidering
PhpDig follows HREF links as shown by any web browser to find the
pages to index.
- Full-text indexing
PhpDig indexes all words of a document, but you can avoid common words
by defining such words in a text file.
- Indexed file types
PhpDig indexes HTML and text files by itself. PhpDig could index PDF,
MS-Word, MS-Excel, and MS-PowerPoint files if you install external
binaries on the server for this purpose.
- Other features
PhpDig tries to read a robots.txt file at the server web root, and
considers META robots tags too. The last-modified header value is
stored in the database to avoid redundant indexing. Also, the META
revisit-after tag is considered.
- Display templates
PhpDig comes with a template system that lets the search page fit
into the look of an existing site.
Source: http://www.phpdig.net/
|