How to install heritrix3

Use svn, checkout the project from the sourceforget.net on https: / / archive-crawler.svn.sourceforge.net/svnroot/archive-crawler/trunk/heritrix3

Especially if you're customizing Heritrix (as seems to be the case from
setting up a dev environment), you should be basing your work off of
Heritrix 3.0.0/heritrix3 trunk (aka 'H3').

H3 is the main focus of our development going forward, and its
Spring-based configuration offers easier opportunities for incremental
extension.

It's also best to work from an SVN checkout, as the working source tree
has Eclipse project-support files (. project,. classpath) as used by the
Heritrix core team.

So my suggestions would be:

- Discard any prior projects

- Make sure your Eclipse install includes SVN and Maven support

- Create a new project, SVN-> "Checkout projects from SVN", using URL

https: / / archive-crawler.svn.sourceforge.net/svnroot/archive-crawler/trunk/heritrix3

- Attempt one Maven2 install build from that checkout, to trigger
population of your local M2_REPO with all necessary 3rd-party libraries

- If Eclipse seems not to recognize paths it should, try one or all of:
- 'Refresh' menupick on project
- Restarting Eclipse
- Toggling the 'build automatically' or 'clean ...' options

These Ubuntu-centric notes from my colleague Steve may be helpful,
though they are still explicitly only regarding H1/H2:

https: / / webarchive.jira.com / wiki / display / ~ siznax / Heritrix + in + Eclipse

If anyone can verify / update these prior guides to work with H3, bringing
a developer from ground state to a working Eclipse H3 dev project,
that'd be greatly appreciated.

猜你喜欢

转载自sharehua.iteye.com/blog/1744428
今日推荐