Python songs_by_user 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: util

메소드/함수: songs_by_user

hotexamples.com에서의 예제들: 4

Python songs_by_user - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 util.songs_by_user에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: colisten.py 프로젝트: RedSunCMX/Kaggle-5

# Build colisten matrix from triplet CSV and save in mtx format
# Usage: python colisten.py <infile> <outfile>

import scipy.sparse, scipy.io
import sys
import util

infile, outfile = sys.argv[1:]

colisten = scipy.sparse.lil_matrix((util.N_SONGS, util.N_SONGS))

for listens in util.songs_by_user(infile):
  for s, _ in listens:
    for t, _ in listens:
      colisten[s-1, t-1] += 1 # Songs are 1-indexed, but scipy uses 0-indexing

scipy.io.mmwrite(file(outfile, 'wb'), colisten)

예제 #2

파일 보기

# Build colisten matrix from triplet CSV and save in mtx format
# Usage: python colisten.py <infile> <outfile>

import scipy.sparse, scipy.io
import sys
import util

infile, outfile = sys.argv[1:]

colisten = scipy.sparse.lil_matrix((util.N_SONGS, util.N_SONGS))

for listens in util.songs_by_user(infile):
    for s, _ in listens:
        for t, _ in listens:
            colisten[s - 1, t -
                     1] += 1  # Songs are 1-indexed, but scipy uses 0-indexing

scipy.io.mmwrite(file(outfile, 'wb'), colisten)

예제 #3

파일 보기

파일: predict_colisten.py 프로젝트: fisheuler/MSR

print "it takes %f secs to read the colisten matrix " % timetoread

listens = colisten.diagonal()

listenranked = numpy.argsort(-listens)[:500]


predict_start = time.clock()

print " predict starts at %f :\n" % predict_start

with open(outfile,'w') as out:

    i = 0

    for history in util.songs_by_user(evalfile):

        i=i+1
        print " we are predict for %d user" % i
        
        songs,counts = zip(*history)

        sim = numpy.array(counts)[numpy.newaxis,:]*\
              colisten[numpy.array(songs)-1,:]


        simidxs = sim.nonzero()[1]

        srt = numpy.lexsort((-listens[simidxs],-sim[0,simidxs]))

        rankidxs = simidxs[srt]

예제 #4

파일 보기

# Usage: python predict_colisten.py <mtxfile> <evalfile> <outfile>

import sys
import scipy.io
import numpy
import util

mtxfile, evalfile, outfile = sys.argv[1:]

colisten = scipy.io.mmread(file(mtxfile)).tocsr()
listens = colisten.diagonal()

listenranked = numpy.argsort(-listens)[:500]

with open(outfile, 'w') as out:
    for history in util.songs_by_user(evalfile):
        songs, counts = zip(*history)

        sim = numpy.array(counts)[numpy.newaxis, :] * colisten[
            numpy.array(songs) - 1, :]

        # All this nonsense is an optimization to avoid the fact that
        # sorting 300,000 numbers 110,000 times is bad for your health.
        # I only sort the songs where sim > 0
        simidxs = sim.nonzero()[1]
        srt = numpy.lexsort((-listens[simidxs], -sim[0, simidxs]))
        rankidxs = simidxs[srt]

        guess = []
        for s in rankidxs:
            if s + 1 in songs: