Difference between revisions of "OAI manual Set up the harvest"

From Archives Portal Europe Wiki
Jump to: navigation, search
(select the set that you want to harvest)
(select the set that you want to harvest)
Line 33: Line 33:
 
<br clear=all>
 
<br clear=all>
  
In this example you have 4 datasets, let's choose 1: naa1, the dataset containing all Nationaal Archief's finding aids in the category 1.x.xx, meaning: finding aids of governmental archives before 1795.
+
In this example you have 4 datasets, let's choose 1: naa1, the dataset containing all Nationaal Archief's finding aids in the category 1.x.xx, meaning: finding aids of governmental archives from before the year 1795.
  
 
== select the FROM and TO dates ==
 
== select the FROM and TO dates ==

Revision as of 21:27, 18 July 2018

To set up the harvest, you just have to follow the instructions displayed on the screen. How does the tool function? It sends the requests to the repository by using the normal [OAI-PMH syntax] (beginning with the first request: the verb Identify) and proposes the choices between the different possibilities offered by the repository as soon as it receives the answers.



indicate the address of your repository

The first question the tool will ask you is the url of the OAI-PMH server. This url or web address must include the prefix: http or https, for example: http://www.gahetna.nl/archievenoverzicht/oai-pmh

OAI Harvester manual, figure 4


The tool then asks you to indicate whether your are using a proxy server. In some network environments access to the internet is secured via a proxy server. If that is the case, then enter the url or web address of the proxy server (ask the administrator of your network environment about this). In case you don't use a proxy server, for example in case you use the tool at home, then you can skip answering this question by pressing the enter key.

OAI Harvester manual, figure 5


The harvester begins its dialogue with the repository by sending the request verbs and providing the according answers: list of metadata, list of sets, etc.

select the type of metadata that you want to harvest

The tool lists the types of metadata found in the repository and asks you to select one of them:

OAI Harvester manual, figure 6


In this example, data are provided in three different types of metadata: oai_dc (Dublin Core/XML), oai_ead (a short basic version of an EAD/XML finding aid), and oai_ead_full (the complete full version of an EAD/XML finding aid). Let's choose 3: oai_ead_full.

select the set that you want to harvest

Then the tool lists the datasets found in the repository with the chosen metadata format and gives them an arbitrary number to allow you to choose one. Please note that you can harvest only one dataset at a time, so if you want to harvest everything or more than one dataset, then you have go through this whole process per dataset.

OAI Harvester manual, figure 7


In this example you have 4 datasets, let's choose 1: naa1, the dataset containing all Nationaal Archief's finding aids in the category 1.x.xx, meaning: finding aids of governmental archives from before the year 1795.

select the FROM and TO dates

select the harvest method

select the type of records that you want to save

start the harvest

retrieve the files