您現在的位置是：網站首頁>JAVAPython中的常見數據集打亂方法

Python中的常見數據集打亂方法

宸宸2024-06-13【JAVA】221人已圍觀

爲找教程的網友們整理了相關的編程文章，網友宰俊喆根據主題投稿了本篇教程內容，涉及到Python數據集打亂、Python數據集、Python數據集打亂方法、Python數據集打亂方法相關內容，已被161網友關注，如果對知識點想更進一步了解可以在下方電子資料中獲取。

Python數據集打亂方法

python常見的數據集打亂方法

第一種方法

通過index

x_train, y_train=train_load()

index = [i for i in range(len(x_train))]

np.random.shuffle(index)

x_train= x_train[index]

y_train = y_train[index]

第二種方法

zip()+shuffle()方法

x_train, y_train=train_load()
result = list(zip(x_train, y_train))  # 打亂的索引序列
np.random.shuffle(result)
x_train,y_train = zip(*result)

第三種方法

seed()+shuffle

x_batch, y_batch = train_load()
#加載我所有的數據,這裡想x_batch,Y_batch是list的格式,要注意

seed=100
random.seed(seed)
random.shuffle(x_batch)
random.seed(seed)#一定得重複在寫一遍,和上麪的seed要相同,不然y_batch和x_batch打亂順序會不一樣
random.shuffle(y_batch)

PS：numpy中函數shuffle與permutation都是對原來的數組隨機打亂原來的順序，shuffle中文含義爲洗牌，permutation中文含義爲排列，區別在於shuffle直接在原來的數組上進行操作，改變原來數組的順序，無返廻值。

而permutation不直接在原來的數組上進行操作，而是返廻一個新的打亂順序的數組，竝不改變原來的數組。

python手動打亂數據集

x_train, y_train = np.array(x_train),np.array(y_train)
index = [i for i in range(len(y_train))]
np.random.shuffle(index)
x_train = x_train[index]
y_train = y_train[index]