Python Indexer.get_uuid_name Exemples

Langage de programmation: Python

Espace de nommage/Pack: indexer.smart_indexer

Class/Type: Indexer

Méthode/Fonction: get_uuid_name

Exemples au hotexamples.com: 2

Python Indexer.get_uuid_name - 2 exemples trouvés. Ce sont les exemples réels les mieux notés de indexer.smart_indexer.Indexer.get_uuid_name extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

Indexer(12)

generate_morphline_config(6)

guess_field_types(5)

guess_format(5)

get_kept_field_list(3)

get_unique_field(3)

is_unique_generated(3)

run_morphline(3)

get_uuid_name(2)

Méthodes fréquemment utilisées

Indexer (12)

generate_morphline_config (6)

guess_field_types (5)

guess_format (5)

get_kept_field_list (3)

get_unique_field (3)

is_unique_generated (3)

run_morphline (3)

get_uuid_name (2)

Exemple #1

0

Afficher le fichier

Fichier : api3.py Projet : walter-woodall/hue

def index_file(request): file_format = json.loads(request.POST.get('fileFormat', '{}')) _convert_format(file_format["format"], inverse = True) collection_name = file_format["name"] indexer = Indexer(request.user, request.fs) unique_field = indexer.get_uuid_name(file_format) schema_fields = [{"name": unique_field, "type": "string"}] + \ indexer.get_kept_field_list(file_format['columns']) morphline = indexer.generate_morphline_config(collection_name, file_format, unique_field) collection_manager = CollectionManagerController(request.user) if not collection_manager.collection_exists(collection_name): collection_manager.create_collection(collection_name, schema_fields, unique_key_field=unique_field) job_id = indexer.run_morphline(collection_name, morphline, file_format["path"]) return JsonResponse({"jobId": job_id})

Exemple #2

0

Afficher le fichier

Fichier : tests_indexer.py Projet : walter-woodall/hue

def test_end_to_end(self): fs = cluster.get_hdfs() collection_name = "test_collection" indexer = Indexer("test", fs) input_loc = "/tmp/test.csv" # upload the test file to hdfs fs.create(input_loc, data=IndexerTest.simpleCSVString, overwrite=True) # open a filestream for the file on hdfs stream = fs.open(input_loc) # guess the format of the file file_type_format = indexer.guess_format({'file': {"stream": stream, "name": "test.csv"}}) field_types = indexer.guess_field_types({"file":{"stream": stream, "name": "test.csv"}, "format": file_type_format}) format_ = field_types.copy() format_['format'] = file_type_format # find a field name available to use for the record's uuid unique_field = indexer.get_uuid_name(format_) # generate morphline morphline = indexer.generate_morphline_config(collection_name, format_, unique_field) schema_fields = [{"name": unique_field, "type": "string"}] + indexer.get_kept_field_list(format_['columns']) # create the collection from the specified fields collection_manager = CollectionManagerController("test") if collection_manager.collection_exists(collection_name): collection_manager.delete_collection(collection_name, None) collection_manager.create_collection(collection_name, schema_fields, unique_key_field=unique_field) # index the file indexer.run_morphline(collection_name, morphline, input_loc)