Python HDTDocument.search_triples_bytesの例

プログラミング言語: Python

名前空間/パッケージ名: hdt

クラス/型: HDTDocument

メソッド/関数: search_triples_bytes

hotexamples.comのコード掲載数: 2

Python HDTDocument.search_triples_bytes - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのhdt.HDTDocument.search_triples_bytesの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

search_triples(30)

HDTDocument(30)

configure_hops(9)

convert_term(9)

compute_hops(7)

remove(7)

search_triples_ids(5)

convert_id(4)

search_triples_bytes(2)

__init__(1)

filter_types(1)

get(1)

search_join(1)

string_to_global_id(1)

コード例 #1

ファイルを表示

def get_rdf_reader(file_path, format='nt'):
    """Get an iterator over RDF triples from a file"""
    iterator = None
    nb_triples = 0
    # load using rdflib
    if format == 'ttl':
        g = Graph()
        g.parse(file_path, format=format)
        nb_triples = len(g)
        iterator = map(__n3_to_str, g.triples((None, None, None)))
    elif format == 'nt':
        print('Counting triples using the wc command...')
        total = wccount(file_path)
        print('The file contains {} triples.'.format(total))
        f = open(file_path, 'r')
        iter = yield_triples(f)
        return iter, total, f

    elif format == 'hdt':
        # load HDTDocument without additional indexes (not needed since we do a ?s ?p ?o)
        doc = HDTDocument(file_path, True, True)
        iterator, nb_triples = doc.search_triples_bytes("", "", "")
    return iterator, nb_triples

コード例 #2

ファイルを表示

ファイル: obtain_type.py プロジェクト: shuaiwangvu/Logical_Inconsistency_LOD

import random
from tarjan import tarjan
from collections import Counter



PATH_LOD = "/scratch/wbeek/data/LOD-a-lot/data.hdt"
hdt_file = HDTDocument(PATH_LOD)

subclass = "http://www.w3.org/2000/01/rdf-schema#subClassOf"
rdfsClass = "http://www.w3.org/2000/01/rdf-schema#Class"
owlClass = "http://www.w3.org/2002/07/owl#Class"
eqClass = "http://www.w3.org/2002/07/owl#equivalentClass"
type = "http://www.w3.org/1999/02/22-rdf-syntax-ns#type"

(triples, cardi1) = hdt_file.search_triples_bytes("", type, "")
print ('there are in total ', cardi1, ' triples')

count = 0
ct = Counter()
for (_,_, t) in triples:
	count += 1
	if count %1000000 == 0:
		print (count , ', processed. That makes ',count / cardi1)
	try:
		t = t.decode('UTF-8')
	except UnicodeDecodeError as err:
		t = str(t, errors='ignore')
	ct[t] += 1

print (ct.most_common(100))