configuring greenstone's self-archive tool - the depositor

19
Incremental building on Greenstone

Upload: diego-spano

Post on 18-Dec-2014

476 views

Category:

Technology


1 download

DESCRIPTION

How to configure the depositor, a web form to do self-archive documents into Greenstone collections.

TRANSCRIPT

Page 1: Configuring Greenstone's self-archive tool - The Depositor

Incremental buildingon Greenstone

Page 2: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 2 |

Import / BuildImport / Build

Page 3: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 3 |

Documents

ImportMetadata extraction

Convertion to GA Format

GA + source docs

Build

Collect.cfg(plugins)

GS Collection: indexes and browsing structures…

Collect.cfg(indexes,

classifiers)

Search ResultsCollect.cfg + macros

Page 4: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 4 |

$GSDLHOME

collect

demo

import archives building index etc perllib

Source documents

import.plbuildcol.pl

rm –r index/*mv building/* index

Collection is served from here

Collect.cfg

The building workflow

Page 5: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 5 |

demo

import archives building index etc perllib

Doc1.pdfDoc2.pdfDoc3.docDoc4.jpg

HASH0105.dirHASH017d.dirHASH63e6.dirHASHaad6.dir

(empty)

assocdtxsttstxTextbuild.cfg

collect.cfg

The building workflow

Page 6: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 6 |

DepositorDepositor

Page 7: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 7 |

The Depositor

Edit /greenstone/etc/main.cfg and add:

depositor enabled

Page 8: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 8 |

The Depositor

You can choose any collection for upload the document but…

You need a user/password created by the GS administrator

Page 9: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 9 |

The Depositor

You can choose any collection for upload the document but…

You need a user/password created by the GS administrator

Page 10: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 10 |

The Depositor

Now we have access to the Form…

Page 11: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 11 |

The Depositor

We select the file to upload…

And Confirm before uploading the item

Page 12: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 12 |

The Depositor

There are two options for the last step. We can choose one of them just editing the macro file “deposit.dm”:

_laststep_ {bild}

or

_laststep_ {depositonly}

Page 13: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 13 |

The Depositor

If we select _laststep_ {bild}

then Depositor will build the collection

every time a document is uploaded.

Page 14: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 14 |

The Depositor

If we select _laststep_{depositonly} then Depositor will only save the document in

import folder.We have to build the

collection when we want to update the index.

Page 15: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 15 |

The Depositor

How can we build the collection to get the uploaded documents indexed???

We can use GLI

Or

We can use Command Line

Page 16: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 16 |

The DepositorIn GLI CREATE panel Minimal Rebuild option

Page 17: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 17 |

C:\> cd "C:\greenstone"

C:\greenstone> setup.bat

The first time we will make a full build

C:\greenstone> perl -S import.pl coll_name

C:\greenstone> perl -S buildcol.pl coll_name

Next time we run only incremental process

C:\greenstone> perl -S incremental-rebuild.pl coll_name

The DepositorOr… open a terminal window and …

Page 18: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 18 |

Depositor will only work with Apache web server. No way with local library server.

Incremental build needs Lucene as indexer (no MG or MGPP)

Before doing an incremental indexing we have to run a full import/build process

Incremental build through command line can be scheduled in cron (Linux) or scheduled task (Windows)

The DepositorThings to be careful about

Page 19: Configuring Greenstone's self-archive tool - The Depositor

Diego J. Spano 2011 - Página | 19 |

THANKS !!!

Diego [email protected]

www.prodigioconsultores.com