Abstract
Release made in advance of radical frontier changes. Added bandwidth throttle, operator 'diary', settable robots expiration, crawler cookie pre-population, and changing of certain options mid-crawl. Many UI improvements including UI display of critical exceptions, UI desccription of job-order options, and improved reporting. Optimizations. Updated httpclient lib to 2.0 release and jmx libs to 1.2.1. Lots of bug fixes.
Table 14. Changes
ID | Type | Summary |
---|---|---|
861861 | Add | 861861 Redirects(/refreshes) from seeds should == new seeds |
899223 | Add | 899223 Special seed-success report should be offered |
891986 | Add | 891986 Bandwidth throttle function, setting. |
877275 | Add | 877275 integrated operator 'diary' needed |
891983 | Add | 891983 IP, Robots expirations should be settable |
910152 | Add | 910152 Recovery of old jobs on WUI (re)start |
781171 | Add | 781171 parsing css |
912986 | Add | 912986 log views should give an idea of file size (where possible) |
912989 | Add | 912989 Alerts should have 'select all' button... |
856593 | Add | 856593 [load][save][turn on/off] cookies |
912201 | Add | 912201 Add levels to alerts |
896665 | Add | 896665 Split processor chains. |
896754 | Add | 896754 Show total of disregards |
903095 | Add | 903095 Show increments of megabytes in ui |
896794 | Add | 896794 serious errors (eg outofmemory) should show up in UI |
900520 | Add | 900520 Short description of ComplexTypes in user interface. |
899982 | Add | 899982 Should be possible to alter filters while crawling. |
896672 | Add | 896672 Display progress (doc/sec) with more precision |
896677 | Add | 896677 Highlight the success or failures of each seed |
896760 | Add | 896760 Prominent notification when seeds have problems |
896801 | Add | 896801 java regexps (in log view) need help text |
896778 | Add | 896778 Log viewing enhancements: |
896795 | Add | 896795 frontier, thread report improvements |
876516 | Add | 876516 default launch should nohup, save stdout/stderr |
896763 | Fix | 127.0.0.1 in job report |
896767 | Fix | Frontier retry-delay should include units (eg -seconds) |
898994 | Fix | Revisiting admin URIs if not logged in should prompt login |
899019 | Fix | Deadlock in Andy's 2nd Crawl |
767225 | Fix | Better bad-config handling |
815357 | Fix | mysterious pause facing network (DNS) problem |
896747 | Fix | ExtractorJS's report overstates it's discovered URIs |
896667 | Fix | Web UI does not display correctly in IE |
896780 | Fix | console clarity/safety |
896655 | Fix | Does not respect per settings added after crawl was started. |
856555 | Fix | 'empty' records in compressed arc files |