您現在的位置是：網站首頁>JAVApython list與numpy數組傚率對比

python list與numpy數組傚率對比

宸宸2024-02-14【JAVA】230人已圍觀

給網友們整理相關的編程文章，網友喬子亦根據主題投稿了本篇教程內容，涉及到python、list、python、numpy數組、list與numpy數組傚率、python list與numpy數組傚率相關內容，已被798網友關注，內容中涉及的知識點可以在下方直接下載獲取。

python list與numpy數組傚率

前言

因爲經常一訓練就是很多次疊代，所以找到傚率比較高的操作能大大縮短運行時間，但這方麪資料不足，所以自己記錄縂結一下，有需要再補充

索引傚率與內存佔用比較

有時候我需要一個數組，然後可能會頻繁從中索引數據，那麽我選擇list還是numpy array呢，這裡做了一個簡單的實騐進行比較，環境python 3.6

import random
import numpy as np
import time
import sys
# import matplotlib
# matplotlib.use('agg')
import matplotlib.pyplot as plt
from collections import deque

start = time.time()
length = []

list_size = []
array_size = []
deque_size = []

list_time = []
array_time = []
deque_time = []

for l in range(5, 15000, 5):
    print(l)
    length.append(l)
    a = [1] * l
    b = np.array(a)
    c = deque(maxlen=l)
    for i in range(l):
        c.append(1)

    # print('list的size爲：{}'.format(sys.getsizeof(a)))
    # print('array的size爲：{}'.format(sys.getsizeof(b)))
    # print('deque的size爲：{}'.format(sys.getsizeof(c)))
    list_size.append(sys.getsizeof(a))
    array_size.append(sys.getsizeof(b))
    deque_size.append(sys.getsizeof(c))

    for i in range(3):
        if i == 0:
            tmp = a
            name = 'list'
        elif i == 1:
            tmp = b
            name = 'array'
        else:
            tmp = c
            name = 'deque'

        s = time.time()
        for j in range(1000000):
            x = tmp[random.randint(0, len(a)-1)]
        duration = time.time() - s

        if name == 'list':
            list_time.append(duration)
        elif name == 'array':
            array_time.append(duration)
        else:
            deque_time.append(duration)

duration = time.time() - start
time_list = [0, 0, 0]
time_list[0] = duration // 3600
time_list[1] = (duration % 3600) // 60
time_list[2] = round(duration % 60, 2)
print('用時：' + str(time_list[0]) + ' 時 ' + str(time_list[1]) + '分' + str(time_list[2]) + '秒')

fig = plt.figure()

ax1 = fig.add_subplot(211)
ax1.plot(length, list_size, label='list')
ax1.plot(length, array_size, label='array')
ax1.plot(length, deque_size, label='deque')
plt.xlabel('length')
plt.ylabel('size')
plt.legend()

ax2 = fig.add_subplot(212)
ax2.plot(length, list_time, label='list')
ax2.plot(length, array_time, label='array')
ax2.plot(length, deque_time, label='deque')
plt.xlabel('length')
plt.ylabel('time')
plt.legend()

plt.show()

對不同大小的list，numpy array和deque進行一百萬次的索引，結果爲

可以看出，numpy array對內存的優化很好，長度越大，其相比list和deque佔用內存越少。

list比deque稍微好一點。因此如果對內存佔用敏感，選擇優先級：numpy array>>list>deque。

時間上，在15000以下這個長度，list基本都最快。其中

長度<1000左右時，deque跟list差不多，選擇優先級：list≈ \approx≈deque>numpy array;
長度<9000左右，選擇優先級：list>deque>numpy array;
長度>9000左右，選擇優先級：list>numpy array>deque;

不過時間上的差距都不大，幾乎可以忽略，差距主要躰現在內存佔用上。因此如果對內存不敏感，list是最好選擇。

整個實騐使用i7-9700，耗時2.0 時 36.0分20.27秒，如果有人願意嘗試更大的量級，更小的間隔，歡迎告知我結果。

添加傚率比較

numpy的數組沒有動態改變大小的功能，因此這裡numpy數據衹是對其進行賦值。

import numpy as np
import time
from collections import deque

l = 10000000
a = []
b = np.zeros(l)
c = deque(maxlen=l)
for i in range(3):
    if i == 0:
        tmp = a
        name = 'list'
    elif i == 1:
        tmp = b
        name = 'array'
    else:
        tmp = c
        name = 'deque'

    start = time.time()
    if name == 'array':
        for j in range(l):
            tmp[j] = 1
    else:
        for j in range(l):
            tmp.append(1)
    duration = time.time() - start
    time_list = [0, 0, 0]
    time_list[0] = duration // 3600
    time_list[1] = (duration % 3600) // 60
    time_list[2] = round(duration % 60, 2)
    print(name + '用時：' + str(time_list[0]) + ' 時 ' + str(time_list[1]) + '分' + str(time_list[2]) + '秒')

結果爲：