Currently the amount of web data ha

Currently the amount of web data has

increased excessively. Its metadata is widely used in

order to fully exploit web information resources. The

Semantic Web is a Web of data that the World Wide

Web Consortium has the vision to provide a common

framework that allows data to be shared and reused

across applications and enterprises. Thus, there is

the need for the definition of the relations among data
that allows a better and automatic interchange of data.

Resource Description Framework (RDF), which is one

of the fundamental building blocks of the Semantic

Web, gives a formation definition for the interchange

of data. It is a standard for describing web resources.

RDF data is in the form subject-predicate-
object which is called triples. The subject describes

the resource while the predicate is the relation or

property between the subject and the object.

For example, one way to represent the notion
“The woman has the sweets” in RDF is as the triple:

a subject denoting “the woman”, a predicate denoting

“has”, and an object denoting “the sweets.”

Many types of storage engines are designed

and evaluate for triples. One of those types is a

triple store which is a purpose-built database for the

storage and retrieval of triples. Queries on these

triples are in SPARQL, which is a language designed

specifically to query RDF databases. The efficiency

of RDF data analysis depends on the performance of

RDF storage and query engine.

Traditional RDF database systems query

data from native RDF stores or from relational

database systems. The motivation for such native

RDF-specific stores is that the relational model is not

particularly suitable towards storage and retrieval of

RDF data because RDF is a graph data model.

However, relational database systems are equipped

with mature optimization techniques for storing and

querying data.

NoSQL database is another type of database

that is not relational database and not use SQL to

query the data. NoSQL database has the data model

that can divide into four types which are document

database using JSON data format, key-value

database, column store database, and graph

database. NoSQL database has different

characteristics from relational databases, such as

schema-free and replication support. The motivation

for this approach includes the simplicity of design

and the horizontal scaling for supporting big data.

Recently, NoSQL databases have been more

successful than traditional relational database

systems for the ability inprocessing big data on the

cloud effectively [1]. In NoSQL databases,to gain

performance, ACID (Atomic, Consistency, Isolation,

and Duration), which is a set of properties that

guarantee that database transactions are processed

reliably, is sacrificed [2]. However, the advocates of
NoSQL databases argue that they should rather enforce

the triple of requirements including consistency (C),

availability (A) and partitioning tolerance (P), shortly

CAP [1].

of the question how to process web data quickly.

Thus, we propose a method to exploit a NoSQL

database, specifically MongoDB, to store and query

RDF. MongoDB is chosen because it is one of widely

used NoSQL databases. The system first invokes

NoSQL API to retrieve MongoDB data in JSON format.

Then, the JSON parser module converts JSON

data to RDF data. We evaluate our design and

implementation by using the Berlin SPARQL

Benchmark, which is one of the most widely accepted

benchmarks for comparing the performance of three

RDF storage systems which include Apache Jena

TDB (native RDF store), MySQL (relational database),

and MongoDB (NoSQL database).

data management research. Bizer and Schultz [3]

proposed the Berlin SPARQL Benchmark (BSBM) for

comparing the performance of native RDF stores

(Sesame, Virtuoso, Jena TDB, and Jena SDB),

SPARQL-to-SQL rewriters (D2R Server and Virtuoso

RDF Views), and relational database management

systems (MySQL and Virtuoso RDBMS). The rewriting

approach outperformed native RDF storage with the

increasing dataset. The other important result was

that relational database management systems were

faster than the SPARQL-to-SQL rewriters. The authors

of this related paper explained that RDF stores might

not have a mature optimization technique as SQL

query engines had. Our paper uses the BSBM

benchmark to evaluate RDF storage systems but we

also propose the approach to use and evaluate

NoSQL database as a RDF data query processing

system.
There has been some work on querying RDF

data from NoSQL databases [4-6]. Cudre-Mauroux

et al. [4] made the first attempt at characterizing and

comparing NoSQL stores and native RDF stores for

RDF processing. They used the Berlin SPARQL

Benchmark and the DBpedia SPARQL Benchmark to

evaluate and compare a native RDF store (4store)

with four NoSQL databases which included Jena+H-
Base, Hive+HBase, CumulusRDF, and Counchbase.

All experiments were performed on the Amazon EC2

Elastic Compute Cloud infrastructure. Based on the

experimental results, NoSQL systems, such as

Jena+HBase, processed simple SPARQL queries

more efficiently than native RDF stores, such as 4store.

On the other hand, for more complex SPARQL queries

requiring several many joins and filters, NoSQL

systems took longer time than 4store. Although both

this related work and our work compare NoSQL

systems and native RDF systems, but our paper also

evaluates the performance of a relational base

database system as well.

Angles and Gtierrez studied the RDF model

from a database perspective and compared it with

other database models [5]. However, they did not

implement and evaluate a graph database for storying

and querying RDF data like we do. Lately, Bendar et

al. [6] performed the comparison of RDF databases,

NoSQL databases, and relational databases for the

Semantic Web applications with their own developed

benchmark. However, they did not provide the

analysis the types of queries for which each database

was suitable.

Sequeda and Miranker [7] chose to execute

SPARQL queries on RDF representation of the legacy

relational data by implementing the system called

Ultrawrap. Ultrawrap encoded a logical representation

of the database as an RDF graph using SQL views

and a translation of SPARQL queries to SQL queries.

To improve query execution time, detection of
unsatisfiable conditions and self-join elimination could

be applied to the SQL from the translations of

SPARQL queries.

Alexaki et al. [8] presented the ICS-FORTH

RDFSuite, a suite of tools for RDF validation, store,and

querying. They proposed the design of a persistent

RDF store (RSSDB) for loading resource descriptions

in an Object Relational Database Management

System (ORDBMS) by using RDF schema knowledge.

They also presented RQL as a declarative language

for querying both RDF descriptions and schemas.

However, they did not compare their proposed system

with other database systems and did not use a

standard benchmark like BSBM.

Several researchers have attempted to

design and develop RDF storage and query engine

using relational DBMSs [9-11]. Harris et al. [9]

proposed 3store as a RDF storage and query engine

and extended it to support SPARQL query interface

[10]. However, 3store had not been evaluated and

compared with other systems [9-10]. Jena1 [11] and

Jena2 [12] are popular Semantic Web programmers’

toolkits that have been downloaded for several

thousand times. Jena1 is an open-source project,

implemented in Java, and available for download for

free. Its core is the capability in manipulating RDF

graphs. Jena2 was extended to support multiple and

flexible presentations of RDF graphs and to provide

a simple minimal list view of the RDF graph to the

application programmers.

There are several works about scalable RDF

engines for storing, indexing, and querying [13-16].

The main focus of Jena2 was to improve the

performance and scalability due to these problems:

too many joins, single statement table, reification

storage bloat, and query optimization [13]. To address

these issues, the Jena2 schema design supported a

denormalized schema for storing resource URIs and

simple literal values directly in the statement table. In
addition, to improve performance through locality and

caching, Jena2 also supported the use of multiple

statement tables.

Sesame [14] was one of the first architectures

which its aim was for efficient storing and querying a

large amount of RDF data. However, there were some

unsupported operations, such as aggregates [15].

Also, implementing triple store directly in PostgresSQL

was faster than that of Sesame’s interfaces and

SeRQL [15]. Abadi et al. [15] proposed the approach

of vertically partitioning the RDF data. The results

showed that vertical partitioning achieved similar

performance to the property table technique

proposed to reduce the number of self-joins.

The RDF-3X (RDF Triple eXpress) [16],

designed and implemented from scratch specifically

for the management and querying of RDF data,

outperformed the previously best alternative [15] by

one or two orders of magnitude.

The contributions of this paper are as

following:1) applying MongoDB to store and query

RDF data; 2) using the standard Berlin SPARQL

benchmark to compare all three kinds of database

systems: native RDF store, relational database, and

NoSQL database. The analysis of the comparison can

be a guideline for choosing an appropriate database

system for different kinds of applications. For example,

relational databases are suitable for applications with

complex queries while NoSQL databases should be

used for applications with simple queries.

Currently the amount of web data has

increased excessively. Its metadata is widely used in

order to fully exploit web information resources. The

Semantic Web is a Web of data that the World Wide

Web Consortium has the vision to provide a common

framework that allows data to be shared and reused

across applications and enterprises. Thus, there is

the need for the definition of the relations among data
that allows a better and automatic interchange of data.

Resource Description Framework (RDF), which is one

of the fundamental building blocks of the Semantic

Web, gives a formation definition for the interchange

of data. It is a standard for describing web resources.

RDF data is in the form subject-predicate-
object which is called triples. The subject describes

the resource while the predicate is the relation or

property between the subject and the object.

For example, one way to represent the notion
“The woman has the sweets” in RDF is as the triple:

a subject denoting “the woman”, a predicate denoting

“has”, and an object denoting “the sweets.”

Many types of storage engines are designed

and evaluate for triples. One of those types is a

triple store which is a purpose-built database for the

storage and retrieval of triples. Queries on these

triples are in SPARQL, which is a language designed

specifically to query RDF databases. The efficiency

of RDF data analysis depends on the performance of

RDF storage and query engine.

Traditional RDF database systems query

data from native RDF stores or from relational

database systems. The motivation for such native

RDF-specific stores is that the relational model is not

particularly suitable towards storage and retrieval of

RDF data because RDF is a graph data model.

However, relational database systems are equipped

with mature optimization techniques for storing and

querying data.

NoSQL database is another type of database

that is not relational database and not use SQL to

query the data. NoSQL database has the data model

that can divide into four types which are document

database using JSON data format, key-value

database, column store database, and graph

database. NoSQL database has different

characteristics from relational databases, such as

schema-free and replication support. The motivation

for this approach includes the simplicity of design

and the horizontal scaling for supporting big data.

Recently, NoSQL databases have been more

successful than traditional relational database

systems for the ability inprocessing big data on the

cloud effectively [1]. In NoSQL databases,to gain

performance, ACID (Atomic, Consistency, Isolation,

and Duration), which is a set of properties that

guarantee that database transactions are processed

reliably, is sacrificed [2]. However, the advocates of
NoSQL databases argue that they should rather enforce

the triple of requirements including consistency (C),

availability (A) and partitioning tolerance (P), shortly

CAP [1].

of the question how to process web data quickly.

Thus, we propose a method to exploit a NoSQL

database, specifically MongoDB, to store and query

RDF. MongoDB is chosen because it is one of widely

used NoSQL databases. The system first invokes

NoSQL API to retrieve MongoDB data in JSON format.

Then, the JSON parser module converts JSON

data to RDF data. We evaluate our design and

implementation by using the Berlin SPARQL

Benchmark, which is one of the most widely accepted

benchmarks for comparing the performance of three

RDF storage systems which include Apache Jena

TDB (native RDF store), MySQL (relational database),

and MongoDB (NoSQL database).

data management research. Bizer and Schultz [3]

proposed the Berlin SPARQL Benchmark (BSBM) for

comparing the performance of native RDF stores

(Sesame, Virtuoso, Jena TDB, and Jena SDB),

SPARQL-to-SQL rewriters (D2R Server and Virtuoso

RDF Views), and relational database management

systems (MySQL and Virtuoso RDBMS). The rewriting

approach outperformed native RDF storage with the

increasing dataset. The other important result was

that relational database management systems were

faster than the SPARQL-to-SQL rewriters. The authors

of this related paper explained that RDF stores might

not have a mature optimization technique as SQL

query engines had. Our paper uses the BSBM

benchmark to evaluate RDF storage systems but we

also propose the approach to use and evaluate

NoSQL database as a RDF data query processing

system.
There has been some work on querying RDF

data from NoSQL databases [4-6]. Cudre-Mauroux

et al. [4] made the first attempt at characterizing and

comparing NoSQL stores and native RDF stores for

RDF processing. They used the Berlin SPARQL

Benchmark and the DBpedia SPARQL Benchmark to

evaluate and compare a native RDF store (4store)

with four NoSQL databases which included Jena+H-
Base, Hive+HBase, CumulusRDF, and Counchbase.

All experiments were performed on the Amazon EC2

Elastic Compute Cloud infrastructure. Based on the

experimental results, NoSQL systems, such as

Jena+HBase, processed simple SPARQL queries

more efficiently than native RDF stores, such as 4store.

On the other hand, for more complex SPARQL queries

requiring several many joins and filters, NoSQL

systems took longer time than 4store. Although both

this related work and our work compare NoSQL

systems and native RDF systems, but our paper also

evaluates the performance of a relational base

database system as well.

Angles and Gtierrez studied the RDF model

from a database perspective and compared it with

other database models [5]. However, they did not

implement and evaluate a graph database for storying

and querying RDF data like we do. Lately, Bendar et

al. [6] performed the comparison of RDF databases,

NoSQL databases, and relational databases for the

Semantic Web applications with their own developed

benchmark. However, they did not provide the

analysis the types of queries for which each database

was suitable.

Sequeda and Miranker [7] chose to execute

SPARQL queries on RDF representation of the legacy

relational data by implementing the system called

Ultrawrap. Ultrawrap encoded a logical representation

of the database as an RDF graph using SQL views

and a translation of SPARQL queries to SQL queries.

To improve query execution time, detection of
unsatisfiable conditions and self-join elimination could

be applied to the SQL from the translations of

SPARQL queries.

Alexaki et al. [8] presented the ICS-FORTH

RDFSuite, a suite of tools for RDF validation, store,and

querying. They proposed the design of a persistent

RDF store (RSSDB) for loading resource descriptions

in an Object Relational Database Management

System (ORDBMS) by using RDF schema knowledge.

They also presented RQL as a declarative language

for querying both RDF descriptions and schemas.

However, they did not compare their proposed system

with other database systems and did not use a

standard benchmark like BSBM.

Several researchers have attempted to

design and develop RDF storage and query engine

using relational DBMSs [9-11]. Harris et al. [9]

proposed 3store as a RDF storage and query engine

and extended it to support SPARQL query interface

[10]. However, 3store had not been evaluated and

compared with other systems [9-10]. Jena1 [11] and

Jena2 [12] are popular Semantic Web programmers’

toolkits that have been downloaded for several

thousand times. Jena1 is an open-source project,

implemented in Java, and available for download for

free. Its core is the capability in manipulating RDF

graphs. Jena2 was extended to support multiple and

flexible presentations of RDF graphs and to provide

a simple minimal list view of the RDF graph to the

application programmers.

There are several works about scalable RDF

engines for storing, indexing, and querying [13-16].

The main focus of Jena2 was to improve the

performance and scalability due to these problems:

too many joins, single statement table, reification

storage bloat, and query optimization [13]. To address

these issues, the Jena2 schema design supported a

denormalized schema for storing resource URIs and

simple literal values directly in the statement table. In
addition, to improve performance through locality and

caching, Jena2 also supported the use of multiple

statement tables.

Sesame [14] was one of the first architectures

which its aim was for efficient storing and querying a

large amount of RDF data. However, there were some

unsupported operations, such as aggregates [15].

Also, implementing triple store directly in PostgresSQL

was faster than that of Sesame’s interfaces and

SeRQL [15]. Abadi et al. [15] proposed the approach

of vertically partitioning the RDF data. The results

showed that vertical partitioning achieved similar

performance to the property table technique

proposed to reduce the number of self-joins.

The RDF-3X (RDF Triple eXpress) [16],

designed and implemented from scratch specifically

for the management and querying of RDF data,

outperformed the previously best alternative [15] by

one or two orders of magnitude.

The contributions of this paper are as

following:1) applying MongoDB to store and query

RDF data; 2) using the standard Berlin SPARQL

benchmark to compare all three kinds of database

systems: native RDF store, relational database, and

NoSQL database. The analysis of the comparison can

be a guideline for choosing an appropriate database

system for different kinds of applications. For example,

relational databases are suitable for applications with

complex queries while NoSQL databases should be

used for applications with simple queries.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

Currently the amount of web data has increased excessively. Its metadata is widely used in order to fully exploit web information resources. The Semantic Web is a Web of data that the World Wide Web Consortium has the vision to provide a common framework that allows data to be shared and reused across applications and enterprises. Thus, there is the need for the definition of the relations among datathat allows a better and automatic interchange of data. Resource Description Framework (RDF), which is one of the fundamental building blocks of the Semantic Web, gives a formation definition for the interchange of data. It is a standard for describing web resources.RDF data is in the form subject-predicate-object which is called triples. The subject describes the resource while the predicate is the relation or property between the subject and the object. For example, one way to represent the notion“The woman has the sweets” in RDF is as the triple: a subject denoting “the woman”, a predicate denoting “has”, and an object denoting “the sweets.”Many types of storage engines are designed and evaluate for triples. One of those types is a triple store which is a purpose-built database for the storage and retrieval of triples. Queries on these triples are in SPARQL, which is a language designed specifically to query RDF databases. The efficiency of RDF data analysis depends on the performance of RDF storage and query engine. Traditional RDF database systems query data from native RDF stores or from relational database systems. The motivation for such native RDF-specific stores is that the relational model is not particularly suitable towards storage and retrieval of RDF data because RDF is a graph data model. However, relational database systems are equipped with mature optimization techniques for storing and querying data.NoSQL database is another type of database that is not relational database and not use SQL to query the data. NoSQL database has the data model that can divide into four types which are document database using JSON data format, key-value database, column store database, and graph database. NoSQL database has different characteristics from relational databases, such as schema-free and replication support. The motivation for this approach includes the simplicity of design and the horizontal scaling for supporting big data.Recently, NoSQL databases have been more successful than traditional relational database systems for the ability inprocessing big data on the cloud effectively [1]. In NoSQL databases,to gain performance, ACID (Atomic, Consistency, Isolation, and Duration), which is a set of properties that guarantee that database transactions are processed reliably, is sacrificed [2]. However, the advocates ofNoSQL databases argue that they should rather enforcethe triple of requirements including consistency (C), availability (A) and partitioning tolerance (P), shortly CAP [1].of the question how to process web data quickly. Thus, we propose a method to exploit a NoSQL database, specifically MongoDB, to store and query RDF. MongoDB is chosen because it is one of widely used NoSQL databases. The system first invokes NoSQL API to retrieve MongoDB data in JSON format. Then, the JSON parser module converts JSON data to RDF data. We evaluate our design and implementation by using the Berlin SPARQL Benchmark, which is one of the most widely accepted benchmarks for comparing the performance of three RDF storage systems which include Apache Jena TDB (native RDF store), MySQL (relational database), and MongoDB (NoSQL database).data management research. Bizer and Schultz [3] proposed the Berlin SPARQL Benchmark (BSBM) for comparing the performance of native RDF stores (Sesame, Virtuoso, Jena TDB, and Jena SDB), SPARQL-to-SQL rewriters (D2R Server and Virtuoso RDF Views), and relational database management systems (MySQL and Virtuoso RDBMS). The rewriting approach outperformed native RDF storage with the increasing dataset. The other important result was that relational database management systems were faster than the SPARQL-to-SQL rewriters. The authors of this related paper explained that RDF stores might not have a mature optimization technique as SQL query engines had. Our paper uses the BSBM benchmark to evaluate RDF storage systems but we also propose the approach to use and evaluate NoSQL database as a RDF data query processing system.There has been some work on querying RDF data from NoSQL databases [4-6]. Cudre-Mauroux et al. [4] made the first attempt at characterizing and comparing NoSQL stores and native RDF stores for RDF processing. They used the Berlin SPARQL Benchmark and the DBpedia SPARQL Benchmark to evaluate and compare a native RDF store (4store) with four NoSQL databases which included Jena+H-Base, Hive+HBase, CumulusRDF, and Counchbase. All experiments were performed on the Amazon EC2 Elastic Compute Cloud infrastructure. Based on the experimental results, NoSQL systems, such as Jena+HBase, processed simple SPARQL queries more efficiently than native RDF stores, such as 4store. On the other hand, for more complex SPARQL queries requiring several many joins and filters, NoSQL systems took longer time than 4store. Although both this related work and our work compare NoSQL systems and native RDF systems, but our paper also evaluates the performance of a relational base database system as well. Angles and Gtierrez studied the RDF model from a database perspective and compared it with other database models [5]. However, they did not implement and evaluate a graph database for storying and querying RDF data like we do. Lately, Bendar et al. [6] performed the comparison of RDF databases, NoSQL databases, and relational databases for the Semantic Web applications with their own developed benchmark. However, they did not provide the analysis the types of queries for which each database was suitable.Sequeda and Miranker [7] chose to execute SPARQL queries on RDF representation of the legacy relational data by implementing the system called Ultrawrap. Ultrawrap encoded a logical representation of the database as an RDF graph using SQL views and a translation of SPARQL queries to SQL queries. To improve query execution time, detection ofunsatisfiable conditions and self-join elimination couldbe applied to the SQL from the translations of SPARQL queries. Alexaki et al. [8] presented the ICS-FORTH RDFSuite, a suite of tools for RDF validation, store,and querying. They proposed the design of a persistent RDF store (RSSDB) for loading resource descriptions in an Object Relational Database Management System (ORDBMS) by using RDF schema knowledge. They also presented RQL as a declarative language for querying both RDF descriptions and schemas. However, they did not compare their proposed system with other database systems and did not use a standard benchmark like BSBM. Several researchers have attempted to design and develop RDF storage and query engine using relational DBMSs [9-11]. Harris et al. [9] proposed 3store as a RDF storage and query engine
and extended it to support SPARQL query interface

[10]. However, 3store had not been evaluated and

compared with other systems [9-10]. Jena1 [11] and

Jena2 [12] are popular Semantic Web programmers’

toolkits that have been downloaded for several

thousand times. Jena1 is an open-source project,

implemented in Java, and available for download for

free. Its core is the capability in manipulating RDF

graphs. Jena2 was extended to support multiple and

flexible presentations of RDF graphs and to provide

a simple minimal list view of the RDF graph to the

application programmers.

There are several works about scalable RDF

engines for storing, indexing, and querying [13-16].

The main focus of Jena2 was to improve the

performance and scalability due to these problems:

too many joins, single statement table, reification

storage bloat, and query optimization [13]. To address

these issues, the Jena2 schema design supported a

denormalized schema for storing resource URIs and

simple literal values directly in the statement table. In
addition, to improve performance through locality and

caching, Jena2 also supported the use of multiple

statement tables.

Sesame [14] was one of the first architectures

which its aim was for efficient storing and querying a

large amount of RDF data. However, there were some

unsupported operations, such as aggregates [15].

Also, implementing triple store directly in PostgresSQL

was faster than that of Sesame’s interfaces and

SeRQL [15]. Abadi et al. [15] proposed the approach

of vertically partitioning the RDF data. The results

showed that vertical partitioning achieved similar

performance to the property table technique

proposed to reduce the number of self-joins.

The RDF-3X (RDF Triple eXpress) [16],

designed and implemented from scratch specifically

for the management and querying of RDF data,

outperformed the previously best alternative [15] by

one or two orders of magnitude.

The contributions of this paper are as

following:1) applying MongoDB to store and query

RDF data; 2) using the standard Berlin SPARQL

benchmark to compare all three kinds of database

systems: native RDF store, relational database, and

NoSQL database. The analysis of the comparison can

be a guideline for choosing an appropriate database

system for different kinds of applications. For example,

relational databases are suitable for applications with

complex queries while NoSQL databases should be

used for applications with simple queries.

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

ปัจจุบันปริมาณของข้อมูลเว็บได้เพิ่มขึ้นมากเกินไป เมตาดาต้าของมันถูกนำมาใช้กันอย่างแพร่หลายในการสั่งซื้อเพื่อใช้ประโยชน์อย่างเต็มที่แหล่งข้อมูลบนเว็บ เว็บความหมายเป็นเว็บของข้อมูลที่เวิลด์ไวด์เว็บสมาคมมีวิสัยทัศน์ที่จะให้ร่วมกันกรอบที่ช่วยให้ข้อมูลที่จะใช้ร่วมกันและนำกลับมาใช้ในการใช้งานและสถานประกอบการ ดังนั้นจึงมีความจำเป็นสำหรับความหมายของความสัมพันธ์ระหว่างข้อมูลที่ช่วยให้การแลกเปลี่ยนที่ดีขึ้นและอัตโนมัติของข้อมูล. ทรัพยากรอธิบายหลักการ (RDF) ซึ่งเป็นหนึ่งของหน่วยการสร้างพื้นฐานของความหมายของเว็บให้คำนิยามการก่อสำหรับการแลกเปลี่ยนข้อมูล มันเป็นมาตรฐานสำหรับการอธิบายแหล่งข้อมูลบนเว็บ. ข้อมูล RDF อยู่ในเรื่อง predicate- รูปแบบวัตถุซึ่งเป็นที่เรียกว่าอเนกประสงค์ เรื่องนี้อธิบายทรัพยากรในขณะที่คำกริยาที่มีความสัมพันธ์หรือสถานที่ให้บริการระหว่างวัตถุและวัตถุ. ยกตัวอย่างเช่นวิธีหนึ่งที่จะเป็นตัวแทนของความคิด"ผู้หญิงที่มีขนม" ใน RDF เป็นสาม: เป็นเรื่องที่แสดงถึง "ผู้หญิงคนนั้น" เป็นคำกริยาที่แสดงถึง"มี" และวัตถุที่แสดงถึง "ขนม." หลายประเภทของเครื่องมือการเก็บรักษาได้รับการออกแบบและประเมินผลสำหรับอเนกประสงค์ หนึ่งในประเภทที่เป็นร้านค้าที่สามซึ่งเป็นฐานข้อมูลที่สร้างขึ้นเพื่อวัตถุประสงค์สำหรับการจัดเก็บและการดึงของอเนกประสงค์ คำสั่งเหล่านี้อเนกประสงค์อยู่ใน SPARQL ซึ่งเป็นภาษาที่ได้รับการออกแบบมาโดยเฉพาะเพื่อค้นหาฐานข้อมูลRDF ประสิทธิภาพของการวิเคราะห์ข้อมูล RDF ขึ้นอยู่กับประสิทธิภาพการทำงานของเก็บขยะเชื้อเพลิงและเครื่องยนต์แบบสอบถาม. ฐานข้อมูล RDF แผนระบบการสอบถามข้อมูลจากร้านค้าRDF พื้นเมืองสัมพันธ์หรือจากระบบฐานข้อมูล แรงจูงใจสำหรับการดังกล่าวพื้นเมืองร้านค้าRDF-เฉพาะคือว่ารูปแบบความสัมพันธ์ที่ไม่เหมาะสมโดยเฉพาะอย่างยิ่งต่อการจัดเก็บและการดึงข้อมูลRDF เพราะ RDF เป็นรูปแบบกราฟข้อมูล. แต่ระบบฐานข้อมูลเชิงสัมพันธ์มีการติดตั้งกับเทคนิคการเพิ่มประสิทธิภาพผู้ใหญ่สำหรับการจัดเก็บและการสอบถามข้อมูลฐานข้อมูล NoSQL อีกประเภทหนึ่งของฐานข้อมูลที่ไม่ได้เป็นฐานข้อมูลเชิงสัมพันธ์และไม่ใช้SQL เพื่อสอบถามข้อมูล ฐานข้อมูล NoSQL มีรูปแบบข้อมูลที่สามารถแบ่งออกเป็นสี่ประเภทที่มีเอกสารฐานข้อมูลโดยใช้รูปแบบข้อมูลJSON, ค่าคีย์ฐานข้อมูลฐานข้อมูลที่เก็บคอลัมน์และกราฟฐานข้อมูล ฐานข้อมูล NoSQL ที่แตกต่างกันมีลักษณะจากฐานข้อมูลเชิงสัมพันธ์เช่นคีมาฟรีและการสนับสนุนการจำลองแบบ แรงจูงใจสำหรับวิธีการนี้รวมถึงความเรียบง่ายของการออกแบบและการปรับขนาดในแนวนอนเพื่อรองรับข้อมูลขนาดใหญ่. เมื่อเร็ว ๆ นี้ฐานข้อมูล NoSQL ได้รับมากขึ้นประสบความสำเร็จกว่าฐานข้อมูลเชิงสัมพันธ์แบบดั้งเดิมระบบสำหรับความสามารถในการinprocessing ข้อมูลขนาดใหญ่ในระบบคลาวด์ได้อย่างมีประสิทธิภาพ[1] ในฐานข้อมูล NoSQL เพื่อให้ได้รับประสิทธิภาพการทำงานของกรด(อะตอมสอดคล้องแยกและระยะเวลา) ซึ่งเป็นชุดของคุณสมบัติที่รับประกันได้ว่าการทำธุรกรรมฐานข้อมูลมีการประมวลผลได้อย่างน่าเชื่อถือเสียสละ[2] แต่ผู้สนับสนุนของฐานข้อมูล NoSQL ยืนยันว่าพวกเขาค่อนข้างจะบังคับใช้สามของความต้องการรวมถึงความสอดคล้อง(C) ความพร้อม (A) และความอดทนแบ่งพาร์ทิชัน (P) ไม่นานCAP [1]. ของคำถามวิธีการประมวลผลข้อมูลได้อย่างรวดเร็วเว็บดังนั้นเราจึงนำเสนอวิธีการที่จะใช้ประโยชน์จาก NoSQL ฐานข้อมูลเฉพาะ MongoDB, การจัดเก็บและสอบถามRDF MongoDB ถูกเลือกเพราะมันเป็นหนึ่งในการยอมรับอย่างกว้างขวางใช้ฐานข้อมูลNoSQL ระบบแรกเรียกNoSQL API เพื่อดึงข้อมูล MongoDB ในรูปแบบ JSON. จากนั้นโมดูล parser JSON JSON แปลงข้อมูลไปยังข้อมูลRDF เราจะประเมินการออกแบบของเราและการดำเนินงานโดยใช้เบอร์ลิน SPARQL เกณฑ์มาตรฐานซึ่งเป็นหนึ่งในที่สุดได้รับการยอมรับมาตรฐานสำหรับการเปรียบเทียบผลการดำเนินงานของทั้งสามระบบจัดเก็บข้อมูลRDF ซึ่งรวมถึง Apache เจTDB (ร้าน RDF พื้นเมือง), MySQL (ฐานข้อมูลเชิงสัมพันธ์) และ MongoDB (ฐานข้อมูล NoSQL). การวิจัยการจัดการข้อมูล Bizer และชูลท์ซ [3] ที่เสนอเบอร์ลิน SPARQL เกณฑ์มาตรฐาน (BSBM) สำหรับการเปรียบเทียบผลการดำเนินงานของร้านค้าRDF พื้นเมือง(งา, อัจฉริยะ, เจ TDB และเจ SDB) SPARQL เพื่อ SQL Rewriters (D2R Server และอัจฉริยะชมRDF) และการจัดการฐานข้อมูลเชิงสัมพันธ์ระบบ(MySQL และอัจฉริยะ RDBMS) เขียนวิธีการจัดเก็บข้อมูลประสิทธิภาพสูงกว่า RDF พื้นเมืองที่มีชุดข้อมูลที่เพิ่มขึ้น ผลที่สำคัญอื่น ๆ คือการที่ระบบการจัดการฐานข้อมูลเชิงสัมพันธ์ได้เร็วกว่าSPARQL เพื่อ SQL Rewriters ผู้เขียนกระดาษที่เกี่ยวข้องกับเรื่องนี้อธิบายว่าร้านค้า RDF อาจจะไม่ได้มีเทคนิคการเพิ่มประสิทธิภาพSQL ผู้ใหญ่เป็นเครื่องมือแบบสอบถามมี กระดาษของเราใช้ BSBM มาตรฐานในการประเมินระบบจัดเก็บข้อมูล RDF แต่เรายังนำเสนอวิธีการที่จะใช้และประเมินผลฐานข้อมูลNoSQL เป็นประมวลผลการค้นหาข้อมูล RDF ระบบ. มีการทำงานบางอย่างเกี่ยวกับการสอบถาม RDF ข้อมูลจากฐานข้อมูล NoSQL [4-6] Cudre-Mauroux et al, [4] ทำให้ความพยายามครั้งแรกที่พัฒนาการและเปรียบเทียบร้านค้าและร้านค้าNoSQL RDF พื้นเมืองสำหรับการประมวลผลRDF พวกเขาใช้เบอร์ลิน SPARQL เกณฑ์มาตรฐานและเกณฑ์มาตรฐาน DBpedia SPARQL เพื่อประเมินและเปรียบเทียบร้านRDF พื้นเมือง (4store) กับสี่ฐานข้อมูล NoSQL ซึ่งรวมถึงเจ + H- ฐาน Hive + HBase, CumulusRDF และ Counchbase. การทดลองทั้งหมดได้รับการดำเนินการเกี่ยวกับอเมซอน EC2 Elastic Compute Cloud โครงสร้างพื้นฐาน ขึ้นอยู่กับผลการทดลองระบบ NoSQL เช่นเจ+ HBase ประมวลผลง่ายแบบสอบถาม SPARQL มีประสิทธิภาพมากขึ้นกว่าร้านค้า RDF พื้นเมืองเช่น 4store. ในทางกลับกันสำหรับคำสั่ง SPARQL ที่ซับซ้อนมากขึ้นที่ต้องใช้หลายหลายร่วมและตัวกรอง, NoSQL ระบบเอา เวลานานกว่า 4store แม้ว่าทั้งสองงานนี้ที่เกี่ยวข้องและการทำงานของเราเปรียบเทียบ NoSQL ระบบและระบบ RDF พื้นเมือง แต่กระดาษของเรายังประเมินประสิทธิภาพการทำงานของฐานสัมพันธ์ระบบฐานข้อมูลเช่นกัน. มุมและ Gtierrez ศึกษารูปแบบ RDF จากมุมมองของฐานข้อมูลและเมื่อเทียบกับรูปแบบฐานข้อมูลอื่น ๆ [5] แต่พวกเขาไม่ได้ดำเนินการและประเมินผลฐานข้อมูลกราฟสำหรับ storying และสอบถามข้อมูล RDF เหมือนที่เราทำ เมื่อเร็ว ๆ นี้ Bendar et al, [6] ดำเนินการเปรียบเทียบฐานข้อมูล RDF ที่ฐานข้อมูลNoSQL และฐานข้อมูลเชิงสัมพันธ์สำหรับการใช้งานเว็บที่มีความหมายของตัวเองที่พัฒนามาตรฐาน แต่พวกเขาไม่ได้ให้การวิเคราะห์ชนิดของแบบสอบถามที่แต่ละฐานข้อมูลมีความเหมาะสม. Sequeda และ Miranker [7] เลือกที่จะดำเนินการคำสั่งSPARQL กับการเป็นตัวแทนของ RDF มรดกข้อมูลเชิงสัมพันธ์โดยการใช้ระบบที่เรียกว่าUltrawrap Ultrawrap เข้ารหัสการแสดงตรรกะของฐานข้อมูลเป็นกราฟRDF โดยใช้มุมมอง SQL และการแปลของแบบสอบถาม SPARQL แบบสอบถาม SQL ได้. เพื่อปรับปรุงเวลาดำเนินการแบบสอบถามการตรวจสอบเงื่อนไข unsatisfiable และการกำจัดตัวเองเข้าร่วมสามารถนำมาประยุกต์ใช้กับSQL จากการแปลของคำสั่ง SPARQL. Alexaki et al, [8] นำเสนอ ICS-FORTH RDFSuite, ชุดเครื่องมือสำหรับการตรวจสอบ RDF การจัดเก็บและการสอบถาม พวกเขานำเสนอการออกแบบของถาวรร้าน RDF (RSSDB) สำหรับการโหลดรายละเอียดทรัพยากรในการจัดการฐานข้อมูลเชิงสัมพันธ์วัตถุระบบ(ORDBMS) โดยใช้ความรู้สคี RDF. พวกเขายังนำเสนอ RQL เป็นภาษาที่เปิดเผยสำหรับการสอบถามทั้งรายละเอียดRDF และแผนผัง. อย่างไรก็ตาม พวกเขาไม่ได้เปรียบเทียบระบบที่นำเสนอของพวกเขาที่มีระบบฐานข้อมูลอื่นๆ และไม่ได้ใช้มาตรฐานมาตรฐานเช่นBSBM. นักวิจัยหลายคนได้พยายามที่จะออกแบบและพัฒนาเก็บขยะเชื้อเพลิงและเครื่องยนต์แบบสอบถามโดยใช้DBMSs สัมพันธ์ [9-11] แฮร์ริสและอัล [9] เสนอ 3store เป็นที่เก็บ RDF และเครื่องมือค้นหาและขยายการให้การสนับสนุนอินเตอร์เฟซแบบสอบถามSPARQL [10] อย่างไรก็ตาม 3store ไม่ได้รับการประเมินและเมื่อเทียบกับระบบอื่นๆ [9-10] Jena1 [11] และJena2 [12] เป็นที่นิยมการเขียนโปรแกรมเว็บความหมาย 'ชุดเครื่องมือที่ได้รับการดาวน์โหลดได้หลายพันครั้ง Jena1 เป็นโครงการที่มาเปิดดำเนินการในJava และพร้อมสำหรับการดาวน์โหลดฟรี หลักของมันคือความสามารถในการจัดการใน RDF กราฟ Jena2 ได้รับการขยายเพื่อรองรับหลายรูปแบบและการนำเสนอผลงานที่มีความยืดหยุ่นของกราฟRDF และเพื่อให้มุมมองรายการที่น้อยที่สุดที่เรียบง่ายของรูปแบบของกราฟRDF ไปเขียนโปรแกรมประยุกต์. มีหลายผลงานที่เกี่ยวกับการปรับขนาดได้ RDF เป็นเครื่องมือสำหรับการจัดเก็บการจัดทำดัชนีและการสอบถาม[13-16] จุดสนใจหลักของ Jena2 คือการปรับปรุงประสิทธิภาพและความยืดหยุ่นเนื่องจากปัญหาเหล่านี้มากเกินไปร่วมโต๊ะคำเดียว, ทำให้เป็นจริงการขยายตัวการจัดเก็บและเพิ่มประสิทธิภาพการค้นหา [13] เพื่อแก้ไขปัญหาเหล่านี้การออกแบบสคี Jena2 สนับสนุนสคีdenormalized สำหรับการจัดเก็บ URIs ทรัพยากรและค่าที่แท้จริงง่ายโดยตรงในตารางคำสั่ง ในนอกจากนี้ในการปรับปรุงประสิทธิภาพการทำงานผ่านท้องที่และแคชJena2 ยังสนับสนุนการใช้งานของหลายตารางคำสั่ง. งา [14] เป็นหนึ่งในสถาปัตยกรรมแรกที่จุดมุ่งหมายคือการจัดเก็บที่มีประสิทธิภาพและสอบถามข้อมูลจำนวนมากRDF อย่างไรก็ตามมีบางการดำเนินงานที่ไม่สนับสนุนเช่นมวล [15]. นอกจากนี้การดำเนินการจัดเก็บสามโดยตรงใน PostgresSQL ได้เร็วกว่าที่ของอินเตอร์เฟซงาและSeRQL [15] Abadi et al, [15] ที่นำเสนอวิธีการของแนวตั้งแบ่งข้อมูลRDF ผลการศึกษาพบว่าการแบ่งแนวตั้งที่ประสบความสำเร็จที่คล้ายกันประสิทธิภาพเทคนิคตารางทรัพย์สินที่เสนอให้ลดจำนวนของตัวเองร่วม. RDF-3X (RDF Triple ด่วน) [16], ออกแบบและดำเนินการตั้งแต่เริ่มต้นโดยเฉพาะการจัดการและการสอบถามของ RDF ข้อมูลเฮงเลือกที่ดีที่สุดก่อนหน้านี้[15] โดยหนึ่งหรือสองคำสั่งของขนาด. ผลงานของการวิจัยนี้มีดังต่อไปนี้: 1) ใช้ MongoDB การจัดเก็บและสอบถามข้อมูลRDF; 2) โดยใช้มาตรฐานเบอร์ลิน SPARQL มาตรฐานเพื่อเปรียบเทียบทั้งสามชนิดของฐานข้อมูลระบบเก็บ RDF พื้นเมืองฐานข้อมูลเชิงสัมพันธ์และฐานข้อมูลNoSQL การวิเคราะห์เปรียบเทียบที่สามารถเป็นแนวทางในการเลือกฐานข้อมูลที่เหมาะสมของระบบสำหรับชนิดที่แตกต่างของการใช้งาน ยกตัวอย่างเช่นฐานข้อมูลเชิงสัมพันธ์มีความเหมาะสมสำหรับการใช้งานกับคำสั่งที่ซับซ้อนในขณะที่ฐานข้อมูลNoSQL ควรจะใช้สำหรับการใช้งานที่มีคำสั่งง่ายๆ

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

ขณะนี้ปริมาณของข้อมูลเว็บได้

เพิ่มมากเกินไป ของเมตาดาต้าที่ใช้กันอย่างแพร่หลายใน

เพื่อให้เต็มใช้ประโยชน์จากเว็บข้อมูลทรัพยากร

ความหมายของเว็บ เป็นเว็บของข้อมูลในโลกกว้าง

เว็บสมาคมมีวิสัยทัศน์ที่จะให้เหมือนกัน

กรอบที่ช่วยให้ข้อมูลที่จะใช้ร่วมกัน และใช้

ระหว่างโปรแกรมและองค์กร จึงมี

ต้องการความหมายของความสัมพันธ์ระหว่างข้อมูล
ที่ช่วยให้ดีขึ้น และการแลกเปลี่ยนอัตโนมัติของข้อมูล กรอบคำอธิบายทรัพยากร

( RDF ) ซึ่งเป็นหนึ่ง

ของพื้นฐานการสร้างบล็อกของความหมาย

เว็บ ให้สร้างคำนิยามสำหรับการแลกเปลี่ยน

ของข้อมูล มันเป็นมาตรฐานที่อธิบายถึงทรัพยากรเว็บ RDF ข้อมูล

ในรูปของกริยา -
วัตถุซึ่งเรียกว่าอเนกประสงค์ . เรื่องอธิบาย

ทรัพยากรในขณะที่ภาคแสดงคือ ความสัมพันธ์หรือ

คุณสมบัติระหว่างวัตถุกับวัตถุ

เช่น วิธีหนึ่งที่จะแสดงความคิด
" ผู้หญิงมีขนม " ใน RDF เป็นสามเท่า :

เรื่องถึง " ผู้หญิง " เป็นกริยา denoting

" มี " และวัตถุแสดงถึง " ขนม

"หลายประเภทของเครื่องมือกระเป๋าออกแบบ

และประเมินอเนกประสงค์ . หนึ่งในประเภทที่เป็น

สามร้าน ซึ่งมีวัตถุประสงค์ในการสร้างฐานข้อมูลสำหรับ

กระเป๋า และเรียกของอเนกประสงค์ . สอบถามในนี้

อเนกประสงค์อยู่ใน sparql ซึ่งเป็นภาษาที่ออกแบบมา

โดยเฉพาะแบบสอบถามฐานข้อมูล RDF . ประสิทธิภาพของการวิเคราะห์ข้อมูล RDF

ขึ้นอยู่กับการปฏิบัติ

ข้อมูลจัดเก็บและค้นหาเครื่องยนต์

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.