Horová, Chvála: Netextové objekty jako součást databáze VŠKP Brno : Systémy pro zpřístupňování eVŠKP 2009 Non-text objects as a component part of the ETDs database of AMU Iva Horová Radim Chvála
Horová, Chvála: Netextové objekty jako součást databáze VŠKP Brno : Systémy pro zpřístupňování eVŠKP Procedure of creation of documents at AMU 2.Building of the repository 3.Modifications of the repository 4.Links of the repository to the environment 5.Practical demonstration 6.And what to do next? Non-text objects as a component part of the ETDs database of AMU
3 Brno : Systémy pro zpřístupňování eVŠKP The initial state at AMU similar to other places Production of text as well as non-text materials Bachelor projects Master‘s theses Dissertation theses Seminar papers Year-end projects Semester projects And other works (teaching materials)
4 Brno : Systémy pro zpřístupňování eVŠKP 2009 Common situation at other universities Text part (obligatory) Various appendices Title Supervisor Opponents Annotation Grading …… Comparison of the situation at AMU with other universties Graduation from the studies – ONE ETD
5 Brno : Systémy pro zpřístupňování eVŠKP 2009 Situation at AMU Text part (obligatory) Graduation – „qualification performance“ i.e. MORE THAN ONE work Title Supervisor Opponents Annotation Grading ……. „Qualification performance“ 1 Various appendices Another title Another supervisor Another opponent Another annotation Different grading Other performers. and so on. „ Qualification performance“ 2 Various appendices Another title Another supervisor Another opponent Another annotation Different grading Other performers. and so on. 1. Výchozí situace na AMU Comparison of the situation at AMU with other universties Various appendices
6 Brno : Systémy pro zpřístupňování eVŠKP 2009 EXAMPLES: theoretical work + script of a play (text) theoretical work + film theoretical work + files of photographs theoretical work + roles in a theatre plays theoretical work + interpretation performance theoretical work + pedagogical output theoretical work + stage design documentation different technical quality bulk amounts of data Specifics of the final works at AMU Comparison of the situation at AMU with other universties
7 Brno : Systémy pro zpřístupňování eVŠKP 2009 KOS: basic types of ETDs: Theoretical, i.e. text-type „main“ work – type A Play, script (text- type, but not the „main“ one) – type B Film, video – type C Interpretation performance – type D Composition – type E For each type: separate form created a SEPARATE metadata record 2. Budování repositáře: Comparison of the situation at AMU with other universties Working classification of ETDs atAMU
8 Brno : Systémy pro zpřístupňování eVŠKP 2009 Vybudovat pro AMU institucionální repositář s některými archivními funkcemi. ASSIGNMNET: The aim is to create a tool to be used for quick search of the documents and easy assessment of their attractiveness and availability. To build an institutional repository providing access to the works having some archiving functions.
9 Brno : Systémy pro zpřístupňování eVŠKP 2009 Internal and external legislation Selection of types of files to be made accessible Selection of SW for the repository and its modification Workflow 2. Building of the repository
10 Brno : Systémy pro zpřístupňování eVŠKP 2009 External legislation Act No. 111/1998, § 47b – the amendment prescribes obligatory publishing of ETDs: AMU Rector’s Decree No. 2/ On publishing final works at AMU AMU Rector’s Decree No. 3/2006 – Methodology of processing, storage and making the ETDs accessible AMU Rector’s Decree No. 4/2006 – Directive on creation and formal layout of ETDs Aspect of copyright Descriptive metadata – standard MS-EVSKP (electronic theses) Standards of the bibliographic description (library) 2.1 Building of the repository – legislation
11 Brno : Systémy pro zpřístupňování eVŠKP 2009 Internal legislation Aspect of copyright : AMU enters into licence agreements with authors There are various degrees defined. The author provides rights for a specific work The rights are provided at the moment of submitting the work to the information system (KOS) The author has the right the reject access to his work – then the work will be only archived Specific rights provided are also displayed in the repository 2.1 Building of the repository – legislation
12 Brno : Systémy pro zpřístupňování eVŠKP 2009 Other aspects – internal regulations of AMU - Library catalogue is the entry point for users - The system must offer: exports to the library catalogue as well links from the catalogue to the repository search of information about related documents comfort for the „non-standard“ users - Te text work is „superior“ over other works although it is not decisive for the qualification - Hierarchy of records (mother, daughter) - Completion of the metadata and the bibliographic description 2.1 Building of the repository – legislation
13 Brno : Systémy pro zpřístupňování eVŠKP 2009 In 2008 the DSpace system adopted Advantages: Not costly (open source) Easy installation and administration, modifications, localisation Support of standards (XML, DC, METS...) Support of interoperability – OAI-PHM server It supports free as well as secured access (LDAP, …) Efficient search mechanism, as well as full text AMU is not the only university participating, there are also many other universities (web, meeting of VŠB TUO,...) Perzistent identifier - Handle 2.2 Selection of suitable SW
14 Brno : Systémy pro zpřístupňování eVŠKP 2009 Workflow metadata Starting point for the collection – Studies information system – KOS Export of the metadata, creation of a record in Dspace Assignment of the persistent identifier Handle Export to Tinlib Completion of the subject description in Tinlib (subject categories, key words, …) - librarians Adding (import) of the subject description to Dspace Making it accessible for harvesting (currently for theses.cz - MU) 2.3 Workflow
15 Brno : Systémy pro zpřístupňování eVŠKP 2009 Formats of digitalised documents Text, static image and combined documents Sound documents Video records PDF/A-1a mp3 flv, 720 x 576px D1-PAL, 1500 kbps Full versions of the non-text works will be available at the departments The selection of the formats changes – i.e. Decree of the Government of the Czech Republic No. 1338, of 3 November 2008 „VIEW“ FORMATS: 2.3 Workflow – selection of formats
16 Brno : Systémy pro zpřístupňování eVŠKP 2009 Workflow of full texts (in cooperation with the Czech Technical University in Prague) Conversion of full texts to the defined formats: texts, static images – PDF/A-1a - (standardisation necessary for full text search) tool: print2pdf – S602 Audio – mp3 – it is not a problem Video – FLV – a problem in general, but AMU tries to consider whether to use it, FAMU does not want to accept it “YouTube” - we follow the trends Upload to Dspace – currently manually Making the works accessible in accordance with the licence agreement in Dspace Full versions are not provided outside the AMU 2.3 Workflow
17 Brno : Systémy pro zpřístupňování eVŠKP 2009 Structure of the metadata Links between the existing records Extraction for the full text search (pdf) Other modifications (layout,...) 3. Modification of the Dspace repository:
18 Brno : Systémy pro zpřístupňování eVŠKP NameSpace: Dublin Core from the basic installation 2. NameSpace: AMU – elements missing to MS-EVSKP: a.Author ID b.Author‘s date of birth c.Code of the department d.Name of the department e.ID of studies to which the work belongs f.Type of work (forms A, B, C) Metadata file can be disseminated during operation 3.1 Modification of Dspace – structure of metadata DCAMU MS eVŠKP
19 Brno : Systémy pro zpřístupňování eVŠKP Modification of DSpace – structure of metadata – additional component parts
20 Brno : Systémy pro zpřístupňování eVŠKP To create a virtual object – “final part of the studies”, a fictious record, URI and to link related objects to it - To use the relations “superior” / “subordinate” work “has a part / is a part of“ 3.2 Modification of Dspace – links among related records There are several possibilities :
21 Brno : Systémy pro zpřístupňování eVŠKP To create a virtual object – “final part of the studies”, a fictious record, URI and to link related objects to it - To use the relations “superior” / “subordinate” work “has a part / is a part of” 3.2 Modification of Dspace – links among related records There are several possibilities :
22 Brno : Systémy pro zpřístupňování eVŠKP 2009 Text part (A) – SUPERIOR RECORD – dc.relation.hasPart – „Has a part“ Other types (B, C) – SUBORDINATE RECORDS - dc.relation.isPartOf - „Is a part of“ 3.2 Modification of Dspace – links among related records The component dc.relation hasPart / isPartOf used - attributes
23 Brno : Systémy pro zpřístupňování eVŠKP Modification of Dspace – links among related records Text work (A) – superior record
24 Brno : Systémy pro zpřístupňování eVŠKP Modification of Dspace – links among related records Other works (B, C) – subordinate record
25 Brno : Systémy pro zpřístupňování eVŠKP Modification of Dspace – links among related records Other works(B, C) – subordinate record Dspace – browse:
26 Brno : Systémy pro zpřístupňování eVŠKP Modification DSpace– extraction of a text for full text search
27 Brno : Systémy pro zpřístupňování eVŠKP Modification DSpace– extraction of a text for full text search Mediafilter: pdfBox pdfToText
28 Brno : Systémy pro zpřístupňování eVŠKP Modification DSpace– other modifications, English version
29 Brno : Systémy pro zpřístupňování eVŠKP Connection of the repository to its environment Interoperability – OAI-PMH Modifications for the Tinlib library system Making the metadata accessible for other harvesters Cooperation with other systems
30 Brno : Systémy pro zpřístupňování eVŠKP Interoperability - OAI PMH Dspace has its own OAI server (support of the OAI- PMH protocol) which secures displaying of the metadata retrieved in Dublin core Java plugin was modified to ensure processing of the metadata added (MS eVSKP) The modification is in the permanent part of the code, it will not be affected by any other upgrades Harvesting (curently) for „theses.cz“ (MU)
31 Brno : Systémy pro zpřístupňování eVŠKP 2009 Based on the value of the element worktype A Text work (A) – SUPERIOR RECORD Monograph Non-text work (B, C, …) – SUBORDINAL RECORD Article 4.2 Modifications for Tinlib XML file retrieved by export from Dspace is converted by means of XML /XLST technology (+processor SAXON) to an import file for Tinlib
32 Brno : Systémy pro zpřístupňování eVŠKP Making the metadata accessible for other repositores (Charles University) Dspace contains a module to display the metadata in METS/MODS format, containering of related records
33 Brno : Systémy pro zpřístupňování eVŠKP Making the metadata accessible for other repositores (Charles University) Dspace contains a module to display the metadata in the METS/MODS format, containering of related records
34 Brno : Systémy pro zpřístupňování eVŠKP Making the metadata accessible for other repositores (Charles University) A test with University Computer Centre of Charles University - DigiTool
35 Brno : Systémy pro zpřístupňování eVŠKP 2009 And now the practical part …
36 Brno : Systémy pro zpřístupňování eVŠKP 2009 Study Information System KOS Assignment of work – department Details about the Work - student Repository of AMU -> DSpace Library system librarians -> Tinlib Library system readers -> Tinweb Manual processing National registry of ETDs „theses“ MU Brno -> the public Full text - student PDF/A file UPLOAD OF RECORDS ABOUT ETDs Harvest OAI PMH
37 Brno : Systémy pro zpřístupňování eVŠKP 2009 Study Information System KOS Assignment of work – department Details about the Work - student Repository of AMU -> DSpace Library system librarians -> Tinlib Library system readers -> Tinweb Manual processing National registry of ETDs „theses“ MU Brno -> the public Full text - student PDF/A file UPLOAD OF RECORDS ABOUT ETDs Harvest OAI PMH
38 Brno : Systémy pro zpřístupňování eVŠKP 2009 Study Information System KOS Assignment of work – department Details about the Work - student Repository of AMU -> DSpace Library system librarians -> Tinlib Library system readers -> Tinweb Manual processing National registry of ETDs „theses“ MU Brno -> the public Full text - student PDF/A file UPLOAD OF RECORDS ABOUT ETDs Harvest OAI PMH
39 Brno : Systémy pro zpřístupňování eVŠKP 2009 SEARCH RECORDS Repository AMU DSpace Library system Tinweb Everything from AMU National registry ETDs- THESES Everything from the universities in the Czech Republic User Full version– text/view
40 Brno : Systémy pro zpřístupňování eVŠKP 2009 SEARCH RECORDS Repository AMU DSpace Library system Tinweb Everything from AMU National registry ETDs- THESES Everything from the universities in the Czech Republic User Full version– text/view
41 Brno : Systémy pro zpřístupňování eVŠKP 2009 Examples on lineon line…
42 Brno : Systémy pro zpřístupňování eVŠKP 2009 Next… In cooperation with the ETDs Working Group and the Dspace community : -Terminology -Archiving – the technical part -The Relations to be incorporated into the metadata standard MS-EVSKP Dspace community: -Acces rights – structure -Displaying hierarchy of records And what to do next :
43 Brno : Systémy pro zpřístupňování eVŠKP 2009 For NON-TEXTS – FULL VERSIONS?: Creative artistic activity Work of art Practical part and so on. For the WHOLE: Qualification performance Assignment for the final work (ETDs) Will these records be of interests for theses.cz ? Terminology Issues to be discussed
Horová, Chvála: Netextové objekty jako součást databáze VŠKP Brno : Systémy pro zpřístupňování eVŠKP 2009 Thank you for your attention Any question?
45 Brno : Systémy pro zpřístupňování eVŠKP 2009
46 Brno : Systémy pro zpřístupňování eVŠKP 2009
47 Brno : Systémy pro zpřístupňování eVŠKP 2009
48 Brno : Systémy pro zpřístupňování eVŠKP 2009
49 Brno : Systémy pro zpřístupňování eVŠKP 2009
50 Brno : Systémy pro zpřístupňování eVŠKP 2009
51 Brno : Systémy pro zpřístupňování eVŠKP 2009
52 Brno : Systémy pro zpřístupňování eVŠKP 2009
53 Brno : Systémy pro zpřístupňování eVŠKP 2009