Python load_datasource 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: azureml.dataprep.datasource

메소드/함수: load_datasource

hotexamples.com에서의 예제들: 3

Python load_datasource - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 azureml.dataprep.datasource.load_datasource에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: score.py 프로젝트: isabella232/AzureML-Recommendation-RRS

def init():
    from azureml.dataprep import datasource
    df = datasource.load_datasource('ratings.dsource')

    from pyspark.ml.recommendation import ALS
    als = ALS() \
        .setUserCol("userId") \
        .setRatingCol("rating") \
        .setItemCol("movieId") \

    alsModel = als.fit(df)
    global userRecs
    userRecs = alsModel.recommendForAllUsers(10)

    # Query them in SQL
    import pydocumentdb.documents as documents
    import pydocumentdb.document_client as document_client
    import pydocumentdb.errors as errors
    import datetime

    MASTER_KEY = 'oX6tWPep8FCah8RM258s7cC3x9Kl8tWdbDxmNknXCP34ShW1Ag1ladvb5QWuBmMxuRISBO2HfrRFv3QeJYCSYg=='
    HOST = 'https://dcibrecommendationhack.documents.azure.com:443/'
    DATABASE_ID = "recommendation_engine"
    COLLECTION_ID = "user_recommendations"
    database_link = 'dbs/' + DATABASE_ID
    collection_link = database_link + '/colls/' + COLLECTION_ID

    global client, collection
    client = document_client.DocumentClient(HOST, {'masterKey': MASTER_KEY})
    collection = client.ReadCollection(collection_link=collection_link)

예제 #2

파일 보기

# initialize logger
run_logger = get_azureml_logger()

from azureml.dataprep import datasource

# start Spark session
spark = pyspark.sql.SparkSession.builder.appName(
    'classification').getOrCreate()
# print runtime versions
print('****************')
print('Python version: {}'.format(sys.version))
print('Spark version: {}'.format(spark.version))
print('****************')
print('***Prepare Input Data to get required attributes***')
inputdata = datasource.load_datasource('POLines.dsource')
data = inputdata.dropna(subset=['Category'])

print('***Filtering Training + Testing + Validation records***')
dsinput = data[data['Category'] != ""]
rawdata = dsinput[[
    'Category', 'Scenario', 'Company Code', 'Type', 'PGr', 'Created',
    'Short Text', 'Storage Location', 'Vendor Material Number',
    'Base Unit of Measure', 'Unit of Weight', 'Acct Assignment Cat',
    'Material freight grp', 'Plant', 'Profit Center'
]]
pdf = rawdata.toPandas()

print('Preparing a String Column for Classification')
pdf['inputstring'] = pdf[[
    'Scenario', 'Company Code', 'Type', 'PGr', 'Created', 'Short Text',

예제 #3

파일 보기

파일: iris.py 프로젝트: AtmaMani/azml-end-to-end

# Use the Azure Machine Learning data source package
from azureml.dataprep import datasource

# Use the Azure Machine Learning data collector to log various metrics
from azureml.logging import get_azureml_logger
logger = get_azureml_logger()

# This call will load the referenced data source and return a DataFrame.
# If run in a PySpark environment, this call returns a
# Spark DataFrame. If not, it will return a Pandas DataFrame.
df = datasource.load_datasource('iris.dsource')

# Remove this line and add code that uses the DataFrame
df.head(10)