BDBComp: Building a Digital Library for the Brazilian Computer Science Community*
Alberto H. F. Laender1 Marcos André Gonçalves2 Pablo A. Roberto1
1Department of Computer Science Federal University of Minas Gerais 31270-901 - Belo Horizonte - MG Brazil {laender,pabloa}@dcc.ufmg.br
ABSTRACT
This paper reports initial efforts towards building BDBComp, a digital library for the Brazilian computer science community. BDBComp is based on a number of standards (e.g., OAI, Dublin Core, SQL) as well as on new technologies (e.g., Web data extraction tools), which allowed fast and easy prototyping. The paper focuses on architectural issues and specific challenges faced during the construction of this digital library as well as on proposed solutions.
Categories and Subject Descriptors
D.3.7 [Information Systems]: Information Storage and Retrieval – Digital Libraries General Terms
Design, Economics. Keywords
Computing Digital Libraries, OAI, DL Modeling, National DLs.
1. INTRODUCTION
The last two decades has witnessed the consolidation of the Brazilian computer science (CS) community as the largest and most active one in Latin America. According to a recent census conducted by the Ministry of Education (www.inep.gov.br), the number of undergraduate programs in computer science and computer engineering in Brazil has grown from less than 20 in the early 80’s to more than 360 in 2002. The number of graduate programs also has grown at approximately the same rate and today, considering only those whose main core is computer science, there are 29 programs in the country. As a result, there has been a considerable increase in the number of theses and dissertations concluded in these programs as well in the number of papers published in international conference proceedings and journals. In addition, the Brazilian Computing Society - SBC (www.sbc.org.br) promotes and organizes about 30 events every year, most of which have official proceedings that collect a substantial part of the community’s scientific production.
Therefore, there exists a strong need for mechanisms for archiving, preserving, indexing, and disseminating the wealth of scientific knowledge produced by the Brazilian CS community. This paper reports initial efforts towards this goal, by describing the design and building of the Brazilian Digital Library of
*This work is partially supported by the I3DL Project (MCT/CNPq/ProTeM-CC grant 680154/01-9).
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. JCDL’04, June 7–11, 2004, Tucson, Arizona, USA. Copyright 2004 ACM 1-58113-832-6/04/0006…$5.00.
23 Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries (JCDL’04) 1-58113-832-6/04 $ 20.00 © 2004 ACM
2 Department of Computer Science
Virginia Tech Blacksburg, VA 24061USA mgoncalv@vt.edu
Computing - BDBComp (www.lbd.dcc.ufmg.br/bdbcomp/). Our focus is on architectural issues and specific challenges faced during the construction of this DL as well as on proposed solutions. BDBComp has been designed to be OAI compliant and adopts Dublin Core (DC) as its metadata standard.
2. THE BDBCOMP ARCHITECTURE
The BDBComp architecture comprises three major layers (Figure
1). The user interfaces serve as a “glue” that binds all provided services together. These are diverse interfaces specially tailored to the needs of different communities of users, among them: general users (e.g., educators, apprentices, researchers), contributors, and administrators. The services we expect to provide for general users are those usually available in any DL, such as searching and browsing, as well as more advanced ones such as filtering, recommendation, automatic linking, etc. Currently, BDBComp provides only searching, browsing, and limited linking facilities, similar to the services provided by DBLP [4].
Users Contributors A dm inistrators Interfaces Reviewers
Searching Filtering Linking S elf-A rch iv in g
Browsing
O A I P rotoco l Web-DL
W eb Sites
BDBComp (m etadata)
A d m inistratio n O AI P r otocol
Other R epositories (eg, CITIDEL, DBLP)
Figure 1: The BDBComp Architecture
Services
Repositories
In addition to general purpose services, BDBComp will provide a self-archiving service, for allowing contributors to submit metadata to the main repository, including facilities to import such data for complete conference proceedings and books in a batch mode. Reviewers will play an important role in this service since they will be responsible for approving the metadata submissions. There will also be special purpose administration services. Finally, at the bottom level we find the main repository that stores the metadata describing available resources. In addition to the self-archiving service, we envisage two other ways to collect metadata for the repository: (1) by extracting them from existing Web sites, for instance, by using tools such as the Web- DL environment [1], and (2) by harvesting other OAI complaint repositories. The former deals with a large number of sources of legacy data (e.g., conference and institutional Web sites) already existing in the Web, while the latter supplements the BDBComp information, for example, by including data from works of Brazilian authors published in international conferences and journals (e.g., harvested from DLs such as CITIDEL
BDBComp: อาคารห้องสมุดดิจิทัลบราซิลคอมพิวเตอร์วิทยาศาสตร์ชุมชน *Alberto H. F. Laender1 Marcos André Gonçalves2 ปาโบล A. Roberto11Department ของบราซิลคอมพิวเตอร์วิทยาศาสตร์กลางมหาวิทยาลัยของ Minas Gerais 31270-901 - Belo Horizonte - MG {laender,pabloa}@dcc.ufmg.brบทคัดย่อกระดาษนี้รายงานความพยายามเริ่มต้นไปทางอาคาร BDBComp ห้องสมุดดิจิตอลสำหรับชุมชนวิทยาศาสตร์คอมพิวเตอร์ที่บราซิล BDBComp อยู่ บนหมายเลขของมาตรฐาน (เช่น OAI ลินคอร์ SQL) พร้อมเทคโนโลยีใหม่ (เช่น เว็บข้อมูลแยกเครื่องมือ), ซึ่งได้ต้นแบบที่ง่าย และรวดเร็ว กระดาษเน้นในเรื่องสถาปัตยกรรมและความท้าทายเฉพาะที่ประสบในระหว่างการก่อสร้าง ของไลบรารีนี้ดิจิตอล พร้อมโซลูชั่นที่นำเสนอประเภทและตัวบอกเรื่อง[ระบบสารสนเทศ] D.3.7: เรียก – ข้อตกลงทั่วไปของห้องสมุดดิจิทัลและจัดเก็บข้อมูลออกแบบ เศรษฐศาสตร์ คำสำคัญคอมพิวเตอร์ห้องสมุดดิจิตอล OAI โมเดล DL, DLs ชาติ1. บทนำThe last two decades has witnessed the consolidation of the Brazilian computer science (CS) community as the largest and most active one in Latin America. According to a recent census conducted by the Ministry of Education (www.inep.gov.br), the number of undergraduate programs in computer science and computer engineering in Brazil has grown from less than 20 in the early 80’s to more than 360 in 2002. The number of graduate programs also has grown at approximately the same rate and today, considering only those whose main core is computer science, there are 29 programs in the country. As a result, there has been a considerable increase in the number of theses and dissertations concluded in these programs as well in the number of papers published in international conference proceedings and journals. In addition, the Brazilian Computing Society - SBC (www.sbc.org.br) promotes and organizes about 30 events every year, most of which have official proceedings that collect a substantial part of the community’s scientific production.Therefore, there exists a strong need for mechanisms for archiving, preserving, indexing, and disseminating the wealth of scientific knowledge produced by the Brazilian CS community. This paper reports initial efforts towards this goal, by describing the design and building of the Brazilian Digital Library of*This work is partially supported by the I3DL Project (MCT/CNPq/ProTeM-CC grant 680154/01-9).Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. JCDL’04, June 7–11, 2004, Tucson, Arizona, USA. Copyright 2004 ACM 1-58113-832-6/04/0006…$5.00.23 Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries (JCDL’04) 1-58113-832-6/04 $ 20.00 © 2004 ACM2 Department of Computer ScienceVirginia Tech Blacksburg, VA 24061USA mgoncalv@vt.eduComputing - BDBComp (www.lbd.dcc.ufmg.br/bdbcomp/). Our focus is on architectural issues and specific challenges faced during the construction of this DL as well as on proposed solutions. BDBComp has been designed to be OAI compliant and adopts Dublin Core (DC) as its metadata standard.2. THE BDBCOMP ARCHITECTUREThe BDBComp architecture comprises three major layers (Figure1). The user interfaces serve as a “glue” that binds all provided services together. These are diverse interfaces specially tailored to the needs of different communities of users, among them: general users (e.g., educators, apprentices, researchers), contributors, and administrators. The services we expect to provide for general users are those usually available in any DL, such as searching and browsing, as well as more advanced ones such as filtering, recommendation, automatic linking, etc. Currently, BDBComp provides only searching, browsing, and limited linking facilities, similar to the services provided by DBLP [4].Users Contributors A dm inistrators Interfaces ReviewersSearching Filtering Linking S elf-A rch iv in gBrowsingO A I P rotoco l Web-DLW eb SitesBDBComp (m etadata)A d m inistratio n O AI P r otocolOther R epositories (eg, CITIDEL, DBLP)Figure 1: The BDBComp ArchitectureServicesRepositoriesIn addition to general purpose services, BDBComp will provide a self-archiving service, for allowing contributors to submit metadata to the main repository, including facilities to import such data for complete conference proceedings and books in a batch mode. Reviewers will play an important role in this service since they will be responsible for approving the metadata submissions. There will also be special purpose administration services. Finally, at the bottom level we find the main repository that stores the metadata describing available resources. In addition to the self-archiving service, we envisage two other ways to collect metadata for the repository: (1) by extracting them from existing Web sites, for instance, by using tools such as the Web- DL environment [1], and (2) by harvesting other OAI complaint repositories. The former deals with a large number of sources of legacy data (e.g., conference and institutional Web sites) already existing in the Web, while the latter supplements the BDBComp information, for example, by including data from works of Brazilian authors published in international conferences and journals (e.g., harvested from DLs such as CITIDEL
การแปล กรุณารอสักครู่..
