python无法打开hdf5_保存到hdf5非常慢(Python冻结)

我正在尝试将瓶颈值保存到新创建的hdf5文件中.

瓶颈值分批形成(120,10,10,2048).

保存一个单独的批次占用超过16个演出,python似乎在那一批冻结.根据最近的调查结果(见更新,似乎hdf5占用大内存是可以的,但冻结部分似乎是一个小故障.

我只是试图保存前两批用于测试目的,而且仅用于测试

训练数据集(再次,这是一个测试运行),但我甚至无法通过第一批.它只是在第一批停止并且不循环到下一次迭代.如果我尝试检查hdf5,资源管理器将变得缓慢,Python将冻结.如果我试图杀死Python(即使没有检查hdf5文件),Python也无法正常关闭并强制重启.

这是相关的代码和数据：

总数据点约为90,000 ish,分批发布120个.

Bottleneck shape is (120,10,10,2048)

所以我试图保存的第一批是(120,10,10,2048)

以下是我尝试保存数据集的方法：

with h5py.File(hdf5_path, mode='w') as hdf5:

hdf5.create_dataset("train_bottle", train_shape, np.float32)

hdf5.create_dataset("train_labels", (len(train.filenames), params['bottle_labels']),np.uint8)

hdf5.create_dataset("validation_bottle", validation_shape, np.float32)

hdf5.create_dataset("validation_labels",

(len(valid.filenames),params['bottle_labels']),np.uint8)

#this first part above works fine

current_iteration = 0

print('created_datasets')

for x, y in train:

number_of_examples = len(train.filenames) # number of images

prediction = model.predict(x)

labels = y

print(prediction.shape) # (120,10,10,2048)

print(y.shape) # (120, 12)

print('start',current_iteration*params['batch_size']) # 0

print('end',(current_iteration+1) * params['batch_size']) # 120

hdf5["train_bottle"][current_iteration*params['batch_size']: (current_iteration+1) * params['batch_size'],...] = prediction

hdf5["train_labels"][current_iteration*params['batch_size']: (current_iteration+1) * params['batch_size'],...] = labels

current_iteration += 1

print(current_iteration)

if current_iteration == 3:

break

这是print语句的输出：

(90827, 10, 10, 2048) # print(train_shape)

(6831, 10, 10, 2048) # print(validation_shape)

created_datasets

(120, 10, 10, 2048) # print(prediction.shape)

(120, 12) #label.shape

start 0 #start of batch

end 120 #end of batch

# Just stalls here instead of printing `print(current_iteration)`

它只是在这里暂停(20分钟),并且hdf5文件的大小逐渐增大(现在大约20演出,在我强行杀死之前).实际上我甚至不能用任务管理器强制杀死,我必须重新启动操作系统,在这种情况下实际杀死Python.

更新

在玩了我的代码之后,似乎有一个奇怪的错误/行为.

python无法打开hdf5_保存到hdf5非常慢(Python冻结)

相关文章

python request 留位置4

收藏表数据库_选择您的收藏库

matlab norm向量和矩阵的范数

不能启动的问题社区版安装后_CentOS7下安装docker（亲测+完整）

关于数据可视化页面制作

GitHub Research：超过50％的Java记录语句写错了

matlab rgb2gray的实现

8k分辨率需要多大带宽_又一支持8K分辨率的接口标准发布

白话解说TCP/IP协议三次握手和四次挥手

matlab 去除pdf文档水印

音频信号发生器_1957年，DIY的Hi-Fi 电唱机单电子管音频发生器的音质保真度高...

tensorflow 启动Session(tf.Session()，tf.InteractivesSession()，tf.train.Supervisor().managed_session() )

终极Java日志字典：开发人员最常记录的单词是什么？

iwrite提交不了作业_iWrite英语写作教学与评阅系统移动端——学生使用手册

阻塞IO与非阻塞IO

matlab的输出(命令窗口、fprint函数、disp函数)

g2 折线图点与点之间直线_科学网—ggplot2实现散点折线图 - 肖斌的博文

matlab 字符串处理

【c#基础】泛型

nacos怎么修改服务分组_nacos服务注册如何配置分组？