Python P2.sum_values_for_partitions примеры использования

Язык программирования: Python

Класс/Тип: P2

Метод/Функция: sum_values_for_partitions

Примеров на hotexamples.com: 4

Python P2.sum_values_for_partitions - 4 примера найдено. Это лучшие примеры Python кода для P2.sum_values_for_partitions из пакета books, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

P2(5)

MinHeap(4)

getValidMove(3)

my_sort(3)

integra_polinomio(3)

AlignByDP(2)

sum_values_for_partitions(2)

HOAX(2)

getValidPlayerAction(2)

solve_problem(1)

predict_decade(1)

p_x_given_y(1)

p3(1)

p2(1)

p1(1)

mandelbrot(1)

f23(1)

find_index_optimized(1)

AutoTransaction(1)

draw_image(1)

compare_votes_map(1)

compare_votes(1)

automatedMove(1)

all_x_all_y(1)

Puzzle15MisDist(1)

Puzzle15MisColRow(1)

Puzzle15(1)

NPuzzleProblem(1)

Cohort(1)

year_stats(1)

Пример #1

Показать файл

Файл: P2b_alternative.py Проект: btweinstein/cs205-homework

num_pixels = 2000
rows = sc.range(num_pixels, numSlices=10)
cols = sc.range(num_pixels, numSlices=10)

indices = rows.cartesian(cols)

def mandelbrot_wrapper(row, col):
    x = col/(num_pixels/4.) - 2.
    y = row/(num_pixels/4.) - 2.

    return ((row, col), P2.mandelbrot(x, y))

########### Different from part A: load balancing! ########
new_indices = indices.repartition(100) # Randomly throw jobs between partitions

mandelbrot_load_balanced = new_indices.map(lambda a: mandelbrot_wrapper(*a))

summed_rdd = P2.sum_values_for_partitions(mandelbrot_load_balanced)
summed_result = summed_rdd.collect()

# Now collect the data & plot
plt.hist(summed_result, bins=np.logspace(3, 8, 20))
sns.rugplot(summed_result, color='red')
plt.gca().set_xscale('log')
plt.xlabel('Total Number of Iterations on Partition')
plt.ylabel('Partition Count')
plt.title('Number of Iterations on each Partition')

plt.savefig('P2b_alternative_hist.png', dpi=200, bbox_inches='tight')

Пример #2

Показать файл

    return ((row, col), P2.mandelbrot(x, y))


mandelbrot_rdd = indices.map(lambda a: mandelbrot_wrapper(*a))

# Now collect the data & plot
mandelbrot_result = mandelbrot_rdd.collect()

plt.grid(False)
# I slightly redefined the draw image function as the original
# implementation annoyed me...I did not want to collect in a draw function!
P2.draw_image(data=mandelbrot_result)

plt.savefig('P2a_mandelbrot.png', dpi=200, bbox_inches='tight')

plt.clf()

# Now create the histogram...I recognize that mandelbrot is computed twice
# but it is for my sanity
summed_rdd = P2.sum_values_for_partitions(mandelbrot_rdd)
summed_result = summed_rdd.collect()

plt.hist(summed_result, bins=np.logspace(3, 8, 20))
sns.rugplot(summed_result, color='red')
plt.gca().set_xscale('log')
plt.xlabel('Total Number of Iterations on Partition')
plt.ylabel('Partition Count')
plt.title('Number of Iterations on each Partition')

plt.savefig('P2a_hist.png', dpi=200, bbox_inches='tight')

Пример #3

Показать файл

Файл: P2a.py Проект: btweinstein/cs205-homework

    y = row/(num_pixels/4.) - 2.

    return ((row, col), P2.mandelbrot(x, y))

mandelbrot_rdd = indices.map(lambda a: mandelbrot_wrapper(*a))

# Now collect the data & plot
mandelbrot_result = mandelbrot_rdd.collect()

plt.grid(False)
# I slightly redefined the draw image function as the original
# implementation annoyed me...I did not want to collect in a draw function!
P2.draw_image(data=mandelbrot_result)

plt.savefig('P2a_mandelbrot.png', dpi=200, bbox_inches='tight')

plt.clf()

# Now create the histogram...I recognize that mandelbrot is computed twice
# but it is for my sanity
summed_rdd = P2.sum_values_for_partitions(mandelbrot_rdd)
summed_result = summed_rdd.collect()

plt.hist(summed_result, bins=np.logspace(3, 8, 20))
sns.rugplot(summed_result, color='red')
plt.gca().set_xscale('log')
plt.xlabel('Total Number of Iterations on Partition')
plt.ylabel('Partition Count')
plt.title('Number of Iterations on each Partition')

plt.savefig('P2a_hist.png', dpi=200, bbox_inches='tight')

Пример #4

Показать файл

Файл: P2b.py Проект: btweinstein/cs205-homework

partition_vs_expensive_task = labeled_expensive_tasks.map(
    lambda x: (x[1] % num_partitions, x[0]))

# Get cheap tasks ready to process
cheap_tasks = indices_vs_expensive.filter(lambda x: x[1] == 0)
cheap_tasks = cheap_tasks.map(lambda x: x[0])
labeled_cheap_tasks = cheap_tasks.zipWithIndex()
partition_vs_cheap_task = labeled_cheap_tasks.map(
    lambda x: (x[1] % num_partitions, x[0]))

# Combine cheap & expensive tasks, now designated to an appropriate partition
partition_vs_ij = partition_vs_expensive_task.union(partition_vs_cheap_task)
# Sort data into the correct partition...sorted by key!
sorted_by_partition = partition_vs_ij.sortByKey(numPartitions=100)

mandelbrot_load_balanced = sorted_by_partition.map(
    lambda a: mandelbrot_wrapper(*a[1]))

summed_rdd = P2.sum_values_for_partitions(mandelbrot_load_balanced)
summed_result = summed_rdd.collect()

# Now collect the data & plot
plt.hist(summed_result, bins=np.logspace(3, 8, 20))
sns.rugplot(summed_result, color='red')
plt.gca().set_xscale('log')
plt.xlabel('Total Number of Iterations on Partition')
plt.ylabel('Partition Count')
plt.title('Number of Iterations on each Partition')

plt.savefig('P2b_hist.png', dpi=200, bbox_inches='tight')