SolrWayback bundle 4.2.3
The SolrWayback distribution is an out of the box solution for exploring archived webpages in ARC/WARC format.
Runs under Windows/Linux/MacOs.
All components now runs under java 11 (and still java 8 as well).
Download: https://github.com/netarchivesuite/solrwayback/releases/download/4.2.3/solrwayback_package_4.2.3.zip
This bundle release has patched 'log4shell' in the Solr server included in the bundle. So no patching against 'log4shell' is required.
The standalone warc-indexer has also been patched against 'log4shell'.
No more live leaks.
From version 4.2.1 SolrWayback comes with a build in Serviceworker(javascript worker) that will redirect or block all live leaks. This works in modern browsers.
Playback will still work in legacy browsers using url rewrites, but can leak to the live web unless using http-proxy or sandbox.
How to upgrade from a previous version:
For older version replace solrwayback.war with the latest version in the Tomcat folder.
Compare properties in solrwayback.properties and solrwaybackweb.properties with yours and add new missing properties. (no new properties since 4.2.1)
Patch Solr against 'log4shell', see README.md : https://github.com/netarchivesuite/solrwayback/blob/master/README.md
Changes since 4.2.1:
4.2.3
Fixed in-player video player for some MP4 videos that was classified by Tika as 'application/mp4'.
Fixed log4shell vulnerabity in SolrWayback bundle (Solr and warc-indexer)
4.2.2
Support for Warc record type 'resource'. Also required fix in the warc-indexer and resourcetype added to config3.xml (in indexing folder)
Improved playback for Twitter API harvest (https://github.com/netarchivesuite/so-me). (also changes in solrconfig.xml)
Implemented new WARC file resolver. If WARCS files are removed after indexed, you can add a text file with the new location. Whenever a WARC needs needs to be loaded, if the WARC file is on the list, it will use that location instead of the one indexed into Solr.
Installation guide for SolrWayback bundle:
https://github.com/netarchivesuite/solrwayback/blob/master/README.md
(see the installation section)