python /home/admin/mtr/script_for_cron.py -j default -m 20 -a 'python3 ~/workarea/git/Velours/python/prod/datou.py -j batch_current -C 2497290' -s traitement_3459 -M 0 -S 0 -U 100,80,95
 import MySQLdb succeeded 
Import error (python version)
['/Users/moilerat/Documents/Fotonower/install/caffe/distribute/python', '/home/admin/workarea/git/Velours/python/prod', '/home/admin/workarea/install/darknet', '/home/admin/workarea/git/Velours/python', '/home/admin/workarea/install/caffe_frcnn_python3/py-faster-rcnn/caffe-fast-rcnn/python', '/home/admin/mtr/.credentials', '/home/admin/workarea/install/caffe/python', '/home/admin/workarea/install/caffe_frcnn/py-faster-rcnn/tools', '/home/admin/workarea/git/fotonowerpip', '/home/admin/workarea/install/segment-anything', '/home/admin/workarea/git/pyfvs', '/home/admin/workarea/git/apy', '/usr/lib/python38.zip', '/usr/lib/python3.8', '/usr/lib/python3.8/lib-dynload', '/home/admin/.local/lib/python3.8/site-packages', '/usr/local/lib/python3.8/dist-packages', '/usr/lib/python3/dist-packages']
process id : 3207435
load datou : 0
# VR 17-11-17 : to create in DB !
Here we check the datou graph and we reorder steps !
 Tree builded and cycle checked, now we need to re-order the steps ! 
We have currenlty an error because there is no dependence between the last step for the case tile - detect - glue
We can either keep the depence of, it is better to keep an order compatible with the id of steps if we do not have sons, so a lexical order : (number_son, step_id)
 DONE and to test : checkNoCycle !
Here we check the consistency of inputs/outputs number between the given ones and the db !
eke 1-6-18 : checkConsistencyNbInputNbOutput should be processed after step reordering !
WARNING : step 0 init_dummy_multi_datou is not linked in the step_by_step architecture !
WARNING : step 1294 init_dummy_multi_datou is not linked in the step_by_step architecture !
Number of inputs / outputs for each step checked !
Here we check the consistency of outputs/inputs types during steps connections
eke 1-6-18 : checkConsistencyTypeOutputInput should be processed after checkConsistencyNbInputNbOutput !
DataTypes for each output/input checked !
Unexpected type for variable list_input_json
ERROR or WARNING : can't parse json string Expecting value: line 1 column 1 (char 0)
 Tried to parse : 
(photo_id, hashtag_id, score_max) was removed should we ?
(x0, y0, x1, y1) was removed should we ?
chemin de la photo was removed should we ?
(photo_id, hashtag_id, score_max) was removed should we ?
(x0, y0, x1, y1) was removed should we ?
chemin de la photo was removed should we ?
load thcls
load pdts
Running datou job : batch_current
 TODO datou_current to load to do maybe to take outside batchDatouExec 
updating current state to 1
list_input_json: []
Current got : datou_id : 3459, datou_cur_ids : ['2497290'] with mtr_portfolio_ids : ['19763324'] and first list_photo_ids : []
new path :  /proc/3207435/
Inside batchDatouExec : verbose : 0
# VR 17-11-17 : to create in DB !
Here we check the datou graph and we reorder steps !
 Tree builded and cycle checked, now we need to re-order the steps ! 
We have currenlty an error because there is no dependence between the last step for the case tile - detect - glue
We can either keep the depence of, it is better to keep an order compatible with the id of steps if we do not have sons, so a lexical order : (number_son, step_id)
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
All sons are already in current list !
 DONE and to test : checkNoCycle !
Here we check the consistency of inputs/outputs number between the given ones and the db !
eke 1-6-18 : checkConsistencyNbInputNbOutput should be processed after step reordering !
WARNING : number of outputs for step 11449 mask_detect is not consistent : 3 used against 2 in the step definition !
Step 11452 crop_condition have less inputs used (1) than in the step definition (2) : maybe we manage optionnal inputs !
Step 11452 crop_condition have less outputs used (2) than in the step definition (3) : some outputs may be not used !
Step 11453 merge_mask_thcl_custom have less inputs used (2) than in the step definition (3) : maybe we manage optionnal inputs !
WARNING : number of outputs for step 11453 merge_mask_thcl_custom is not consistent : 4 used against 2 in the step definition !
WARNING : number of inputs for step 11454 rle_unique_nms_with_priority is not consistent : 2 used against 1 in the step definition !
WARNING : number of outputs for step 11454 rle_unique_nms_with_priority is not consistent : 2 used against 1 in the step definition !
Step 11478 crop_condition have less inputs used (1) than in the step definition (2) : maybe we manage optionnal inputs !
WARNING : number of outputs for step 11478 crop_condition is not consistent : 4 used against 3 in the step definition !
WARNING : number of inputs for step 11456 ventilate_hashtags_in_portfolio is not consistent : 2 used against 1 in the step definition !
WARNING : number of outputs for step 11456 ventilate_hashtags_in_portfolio is not consistent : 2 used against 1 in the step definition !
Step 11455 final have less inputs used (2) than in the step definition (3) : maybe we manage optionnal inputs !
Step 11455 final have less outputs used (1) than in the step definition (2) : some outputs may be not used !
Step 11458 send_mail_cod have less inputs used (3) than in the step definition (5) : maybe we manage optionnal inputs !
Number of inputs / outputs for each step checked !
Here we check the consistency of outputs/inputs types during steps connections
eke 1-6-18 : checkConsistencyTypeOutputInput should be processed after checkConsistencyNbInputNbOutput !
WARNING : type of output 2 of step 11449 doesn't seem to be define in the database(
WARNING : type of input 2 of step 11452 doesn't seem to be define in the database(
WARNING : output 1 of step 11449 have datatype=2 whereas input 1 of step 11453 have datatype=7
WARNING : type of output 2 of step 11453 doesn't seem to be define in the database(
WARNING : type of input 1 of step 11454 doesn't seem to be define in the database(
We ignore checkConsistencyTypeOutputInput for datou_step final !
WARNING : type of output 3 of step 11453 doesn't seem to be define in the database(
WARNING : type of input 1 of step 11456 doesn't seem to be define in the database(
WARNING : type of output 1 of step 11456 doesn't seem to be define in the database(
WARNING : type of input 3 of step 11455 doesn't seem to be define in the database(
We ignore checkConsistencyTypeOutputInput for datou_step final !
We ignore checkConsistencyTypeOutputInput for datou_step final !
WARNING : output 0 of step 11456 have datatype=10 whereas input 3 of step 11458 have datatype=6
WARNING : type of input 5 of step 11458 doesn't seem to be define in the database(
WARNING : output 0 of step 11477 have datatype=11 whereas input 5 of step 11458 have datatype=None
WARNING : output 0 of step 11456 have datatype=10 whereas input 0 of step 11477 have datatype=18
WARNING : type of input 2 of step 11478 doesn't seem to be define in the database(
WARNING : output 1 of step 11454 have datatype=7 whereas input 2 of step 11478 have datatype=None
WARNING : type of output 3 of step 11478 doesn't seem to be define in the database(
WARNING : type of input 2 of step 11456 doesn't seem to be define in the database(
WARNING : output 0 of step 11453 have datatype=1 whereas input 0 of step 11454 have datatype=2
DataTypes for each output/input checked !
List Step Type Loaded in datou : mask_detect, crop_condition, thcl, merge_mask_thcl_custom, rle_unique_nms_with_priority, crop_condition, ventilate_hashtags_in_portfolio, final, velours_tree, send_mail_cod, split_time_score
over limit max, limiting to limit_max 20
list_input_json : []
origin
We have 1 , 
WARNING: data may be incomplete, need to offset and complete !
BFBFBFBFBFBFwe have missing 0 photos in the step downloads : 
photo missing : []
try to delete the photos missing in DB
length of list_filenames : 6 ; length of list_pids : 6 ; length of list_args : 6
time to download the photos : 1.3552672863006592
About to test input to load 
we should then remove the video here, and this would fix the bug of datou_current ! 
Calling datou_exec

Inside datou_exec : verbose : 0
number of steps :  11
step1:mask_detect
Fri Feb  7 11:21:32 2025
VR 17-11-17 : now, only for linear exec dependencies tree, some output goes to fill the input of the next
VR 22-3-18 : now we test the dependencies tree, but keep two separate code for datou_prepare_output_input until the code is correctly tested, clean and works in both case
VR 22-3-18 : but we use the first code for the first step id = -1, build in the code of datou_exec 
VR 22-3-18 : we should manage here the case when we are at the first step instead of building this step before datou_exec
Beginning of datou step mask_detect !
save_polygon : True

 begin detect 

begin to check gpu status 
inside check gpu memory
l 3637 free memory gpu now : 4899  max_wait_temp :  1 max_wait :  0
 gpu_flag :  0
2025-02-07 11:21:35.598463: I tensorflow/core/platform/cpu_feature_guard.cc:143] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2025-02-07 11:21:35.623419: I tensorflow/core/platform/profile_utils/cpu_utils.cc:102] CPU Frequency: 3493065000 Hz
2025-02-07 11:21:35.624940: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x7f8260000b60 initialized for platform Host (this does not guarantee that XLA will be used). Devices:
2025-02-07 11:21:35.624962: I tensorflow/compiler/xla/service/service.cc:176]   StreamExecutor device (0): Host, Default Version
2025-02-07 11:21:35.627928: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcuda.so.1
2025-02-07 11:21:35.771835: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2e0e7030 initialized for platform CUDA (this does not guarantee that XLA will be used). Devices:
2025-02-07 11:21:35.771893: I tensorflow/compiler/xla/service/service.cc:176]   StreamExecutor device (0): NVIDIA GeForce RTX 2080 Ti, Compute Capability 7.5
2025-02-07 11:21:35.773142: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1561] Found device 0 with properties: 
pciBusID: 0000:41:00.0 name: NVIDIA GeForce RTX 2080 Ti computeCapability: 7.5
coreClock: 1.545GHz coreCount: 68 deviceMemorySize: 10.76GiB deviceMemoryBandwidth: 573.69GiB/s
2025-02-07 11:21:35.773548: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudart.so.10.1
2025-02-07 11:21:35.776654: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcublas.so.10
2025-02-07 11:21:35.780141: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcufft.so.10
2025-02-07 11:21:35.780525: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcurand.so.10
2025-02-07 11:21:35.783298: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusolver.so.10
2025-02-07 11:21:35.784704: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusparse.so.10
2025-02-07 11:21:35.789820: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudnn.so.7
2025-02-07 11:21:35.791496: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1703] Adding visible gpu devices: 0
2025-02-07 11:21:35.791581: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudart.so.10.1
2025-02-07 11:21:35.792226: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1102] Device interconnect StreamExecutor with strength 1 edge matrix:
2025-02-07 11:21:35.792241: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1108]      0 
2025-02-07 11:21:35.792250: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1121] 0:   N 
2025-02-07 11:21:35.793330: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1247] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 6435 MB memory) -> physical GPU (device: 0, name: NVIDIA GeForce RTX 2080 Ti, pci bus id: 0000:41:00.0, compute capability: 7.5)
WARNING:tensorflow:From /home/admin/workarea/git/Velours/python/mtr/mask_rcnn/mask_detection.py:69: The name tf.keras.backend.set_session is deprecated. Please use tf.compat.v1.keras.backend.set_session instead.

2025-02-07 11:21:36.053544: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1561] Found device 0 with properties: 
pciBusID: 0000:41:00.0 name: NVIDIA GeForce RTX 2080 Ti computeCapability: 7.5
coreClock: 1.545GHz coreCount: 68 deviceMemorySize: 10.76GiB deviceMemoryBandwidth: 573.69GiB/s
2025-02-07 11:21:36.053668: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudart.so.10.1
2025-02-07 11:21:36.054248: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcublas.so.10
2025-02-07 11:21:36.054273: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcufft.so.10
2025-02-07 11:21:36.054293: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcurand.so.10
2025-02-07 11:21:36.054313: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusolver.so.10
2025-02-07 11:21:36.054332: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusparse.so.10
2025-02-07 11:21:36.054352: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudnn.so.7
2025-02-07 11:21:36.055648: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1703] Adding visible gpu devices: 0
2025-02-07 11:21:36.057010: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1561] Found device 0 with properties: 
pciBusID: 0000:41:00.0 name: NVIDIA GeForce RTX 2080 Ti computeCapability: 7.5
coreClock: 1.545GHz coreCount: 68 deviceMemorySize: 10.76GiB deviceMemoryBandwidth: 573.69GiB/s
2025-02-07 11:21:36.057141: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudart.so.10.1
2025-02-07 11:21:36.057169: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcublas.so.10
2025-02-07 11:21:36.057194: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcufft.so.10
2025-02-07 11:21:36.057218: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcurand.so.10
2025-02-07 11:21:36.057242: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusolver.so.10
2025-02-07 11:21:36.057266: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusparse.so.10
2025-02-07 11:21:36.057291: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudnn.so.7
2025-02-07 11:21:36.058710: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1703] Adding visible gpu devices: 0
2025-02-07 11:21:36.058756: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1102] Device interconnect StreamExecutor with strength 1 edge matrix:
2025-02-07 11:21:36.058768: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1108]      0 
2025-02-07 11:21:36.058778: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1121] 0:   N 
2025-02-07 11:21:36.060256: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1247] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 6435 MB memory) -> physical GPU (device: 0, name: NVIDIA GeForce RTX 2080 Ti, pci bus id: 0000:41:00.0, compute capability: 7.5)
Using TensorFlow backend.
WARNING:tensorflow:From /home/admin/workarea/install/Mask_RCNN/model.py:396: calling crop_and_resize_v1 (from tensorflow.python.ops.image_ops_impl) with box_ind is deprecated and will be removed in a future version.
Instructions for updating:
box_ind is deprecated, use box_indices instead
WARNING:tensorflow:From /home/admin/workarea/install/Mask_RCNN/model.py:703: to_int32 (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use `tf.cast` instead.
WARNING:tensorflow:From /home/admin/workarea/install/Mask_RCNN/model.py:729: to_float (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use `tf.cast` instead.
Inside mask_sub_process
Inside mask_detect 
About to load cache.load_thcl_param 
To do loadFromThcl(), then load ParamDescType : thcl2896
 thcls : [{'id': 2896, 'mtr_user_id': 31, 'name': 'learn_convoyeur_qualipapia_nantes_poly_100521_1', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'background,carton_brun,carton_gris,cartonnette,kraft,autre_refus,metal,plastique,teint_dans_la_masse,environnement', 'svm_portfolios_learning': '0,0,0,0,0,0,0,0,0,0', 'photo_hashtag_type': 3663, 'photo_desc_type': 5309, 'type_classification': 'mask_rcnn', 'hashtag_id_list': '0,0,0,0,0,0,0,0,0,0'}]
 thcl {'id': 2896, 'mtr_user_id': 31, 'name': 'learn_convoyeur_qualipapia_nantes_poly_100521_1', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'background,carton_brun,carton_gris,cartonnette,kraft,autre_refus,metal,plastique,teint_dans_la_masse,environnement', 'svm_portfolios_learning': '0,0,0,0,0,0,0,0,0,0', 'photo_hashtag_type': 3663, 'photo_desc_type': 5309, 'type_classification': 'mask_rcnn', 'hashtag_id_list': '0,0,0,0,0,0,0,0,0,0'}
 Update svm_hashtag_type_desc : 5309
FOUND : 1
 Here is data_from_sql_as_vec to set the ParamDescriptorType : (5309, 'learn_convoyeur_qualipapia_nantes_poly_100521_1', 16384, 25088, 'learn_convoyeur_qualipapia_nantes_poly_100521_1', 'pool5', 10.0, None, None, 256, None, 0, None, 8, None, None, -1000.0, 1, datetime.datetime(2021, 5, 10, 19, 20, 46), datetime.datetime(2021, 5, 10, 19, 20, 46))
{'thcl': {'id': 2896, 'mtr_user_id': 31, 'name': 'learn_convoyeur_qualipapia_nantes_poly_100521_1', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'background,carton_brun,carton_gris,cartonnette,kraft,autre_refus,metal,plastique,teint_dans_la_masse,environnement', 'svm_portfolios_learning': '0,0,0,0,0,0,0,0,0,0', 'photo_hashtag_type': 3663, 'photo_desc_type': 5309, 'type_classification': 'mask_rcnn', 'hashtag_id_list': '0,0,0,0,0,0,0,0,0,0'}, 'list_hashtags': ['background', 'carton_brun', 'carton_gris', 'cartonnette', 'kraft', 'autre_refus', 'metal', 'plastique', 'teint_dans_la_masse', 'environnement'], 'list_hashtags_csv': 'background,carton_brun,carton_gris,cartonnette,kraft,autre_refus,metal,plastique,teint_dans_la_masse,environnement', 'svm_portfolios_learning': '0,0,0,0,0,0,0,0,0,0', 'photo_hashtag_type': 3663, 'svm_hashtag_type_desc': 5309, 'photo_desc_type': 5309, 'pb_hashtag_id_or_classifier': 0}
list_class_names : 
['background', 'carton_brun', 'carton_gris', 'cartonnette', 'kraft', 'autre_refus', 'metal', 'plastique', 'teint_dans_la_masse', 'environnement']

Configurations:
BACKBONE                       resnet101
BACKBONE_SHAPES                [[160 160]
 [ 80  80]
 [ 40  40]
 [ 20  20]
 [ 10  10]]
BACKBONE_STRIDES               [4, 8, 16, 32, 64]
BATCH_SIZE                     1
BBOX_STD_DEV                   [0.1 0.1 0.2 0.2]
DETECTION_MAX_INSTANCES        100
DETECTION_MIN_CONFIDENCE       0.3
DETECTION_NMS_THRESHOLD        0.3
GPU_COUNT                      1
IMAGES_PER_GPU                 1
IMAGE_MAX_DIM                  640
IMAGE_MIN_DIM                  640
IMAGE_PADDING                  True
IMAGE_SHAPE                    [640 640   3]
LEARNING_MOMENTUM              0.9
LEARNING_RATE                  0.001
LOSS_WEIGHTS                   {'rpn_class_loss': 1.0, 'rpn_bbox_loss': 1.0, 'mrcnn_class_loss': 1.0, 'mrcnn_bbox_loss': 1.0, 'mrcnn_mask_loss': 1.0}
MASK_POOL_SIZE                 14
MASK_SHAPE                     [28, 28]
MAX_GT_INSTANCES               100
MEAN_PIXEL                     [123.7 116.8 103.9]
MINI_MASK_SHAPE                (56, 56)
NAME                           learn_convoyeur_qualipapia_nantes_poly_100521_1
NUM_CLASSES                    10
POOL_SIZE                      7
POST_NMS_ROIS_INFERENCE        1000
POST_NMS_ROIS_TRAINING         2000
ROI_POSITIVE_RATIO             0.33
RPN_ANCHOR_RATIOS              [0.5, 1, 2]
RPN_ANCHOR_SCALES              (16, 32, 64, 128, 256)
RPN_ANCHOR_STRIDE              1
RPN_BBOX_STD_DEV               [0.1 0.1 0.2 0.2]
RPN_NMS_THRESHOLD              0.7
RPN_TRAIN_ANCHORS_PER_IMAGE    256
STEPS_PER_EPOCH                1000
TRAIN_ROIS_PER_IMAGE           200
USE_MINI_MASK                  True
USE_RPN_ROIS                   True
VALIDATION_STEPS               50
WEIGHT_DECAY                   0.0001


model_param file didn't exist 
model_name : learn_convoyeur_qualipapia_nantes_poly_100521_1
model_type : mask_rcnn
list file need :
['mask_model.h5']
file exist in s3 :
['mask_model.h5']
file manque in s3 : 
[]
2025-02-07 11:21:45.870563: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcublas.so.10
2025-02-07 11:21:46.247699: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudnn.so.7
2025-02-07 11:21:47.699100: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.699728: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 3.60G (3865470464 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.700326: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 3.24G (3478923264 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.700885: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 2.92G (3131030784 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.701451: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 2.62G (2817927680 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.702054: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 2.36G (2536134912 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.702618: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 2.12G (2282521344 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.702654: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.06GiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.703268: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.703287: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.06GiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.709590: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.709634: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.06GiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.710207: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.710241: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.06GiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.716305: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.716347: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 466.56MiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.716925: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.716943: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 466.56MiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.743087: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.743145: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.06GiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.743723: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.743739: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.06GiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.749137: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.749163: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 243.25MiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.749734: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.749750: W tensorflow/core/common_runtime/bfc_allocator.cc:245] Allocator (GPU_0_bfc) ran out of memory trying to allocate 243.25MiB with freed_by_count=0. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2025-02-07 11:21:47.777986: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.778595: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.780245: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.780827: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.816627: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.817255: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.819215: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.819798: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.844759: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.845366: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.846912: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.847491: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.852991: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.853584: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.855281: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.855863: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.861572: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.862165: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.863715: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.864296: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.891406: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.892027: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.892602: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.893173: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.896823: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.897425: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.913015: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.913617: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.914206: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.914779: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.927350: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.927971: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.928547: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.929119: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.933625: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.934229: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.938998: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.939609: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.951817: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.952416: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.956607: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.957191: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.957763: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.958334: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.959235: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.959826: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.971034: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.971658: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.972258: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.972832: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.973404: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.973994: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.974568: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.975149: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.984740: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.985342: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.991685: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:47.992272: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.044924: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.046076: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.055320: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.056231: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.072932: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.073734: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.074472: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.075225: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.080099: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.081092: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.082015: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.082947: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.084635: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.084667: W tensorflow/core/kernels/gpu_utils.cc:49] Failed to allocate memory for convolution redzone checking; skipping this check. This is benign and only means that we won't check cudnn for out-of-bounds reads and writes. This message will only be printed once.
2025-02-07 11:21:48.095125: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.096097: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.105766: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.106640: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.107528: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.108361: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.109201: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
2025-02-07 11:21:48.110032: I tensorflow/stream_executor/cuda/cuda_driver.cc:763] failed to allocate 4.00G (4294967296 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory
local folder  : /data/models_weight/learn_convoyeur_qualipapia_nantes_poly_100521_1


/data/models_weight/learn_convoyeur_qualipapia_nantes_poly_100521_1/mask_model.h5
size_local : 256031040 size in s3 : 256031040
create time local : 2021-08-09 05:45:48 create time in s3 : 2021-08-06 18:59:51
mask_model.h5 already exist and didn't need to update
 list_images length : 6
NEW PHOTO
Processing 1 images
image                    shape: (2160, 3840, 3)       min:    1.00000  max:  255.00000
molded_images            shape: (1, 640, 640, 3)      min: -123.70000  max:  140.10000
image_metas              shape: (1, 18)               min:    0.00000  max: 3840.00000
nb d'objets trouves : 8
NEW PHOTO
Processing 1 images
image                    shape: (2160, 3840, 3)       min:    5.00000  max:  255.00000
molded_images            shape: (1, 640, 640, 3)      min: -123.70000  max:  140.85000
image_metas              shape: (1, 18)               min:    0.00000  max: 3840.00000
nb d'objets trouves : 9
NEW PHOTO
Processing 1 images
image                    shape: (2160, 3840, 3)       min:   17.00000  max:  255.00000
molded_images            shape: (1, 640, 640, 3)      min: -123.70000  max:  142.60000
image_metas              shape: (1, 18)               min:    0.00000  max: 3840.00000
nb d'objets trouves : 2
NEW PHOTO
Processing 1 images
image                    shape: (2160, 3840, 3)       min:    1.00000  max:  255.00000
molded_images            shape: (1, 640, 640, 3)      min: -123.70000  max:  138.60000
image_metas              shape: (1, 18)               min:    0.00000  max: 3840.00000
nb d'objets trouves : 2
NEW PHOTO
Processing 1 images
image                    shape: (2160, 3840, 3)       min:    0.00000  max:  255.00000
molded_images            shape: (1, 640, 640, 3)      min: -123.70000  max:  139.35000
image_metas              shape: (1, 18)               min:    0.00000  max: 3840.00000
nb d'objets trouves : 11
NEW PHOTO
Processing 1 images
image                    shape: (2160, 3840, 3)       min:    9.00000  max:  253.00000
molded_images            shape: (1, 640, 640, 3)      min: -123.70000  max:  137.85000
image_metas              shape: (1, 18)               min:    0.00000  max: 3840.00000
nb d'objets trouves : 3
Detection mask done !
Trying to reset tf kernel
3207537
begin to check gpu status 
inside check gpu memory
l 3610 free memory gpu now : 1707
tf kernel not reseted
sub process len(results) : 6 len(list_Values) 0
None
max_time_sub_proc : 3600
parent process len(results) : 6 len(list_Values) 0
process is alive
finish correctly or not : True

 after detect 

begin to check gpu status 
inside check gpu memory
l 3610 free memory gpu now : 2900
list_Values should be empty []
To do loadFromThcl(), then load ParamDescType : thcl2896
Catched exception !
Connect or reconnect !
 thcls : [{'id': 2896, 'mtr_user_id': 31, 'name': 'learn_convoyeur_qualipapia_nantes_poly_100521_1', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'background,carton_brun,carton_gris,cartonnette,kraft,autre_refus,metal,plastique,teint_dans_la_masse,environnement', 'svm_portfolios_learning': '0,0,0,0,0,0,0,0,0,0', 'photo_hashtag_type': 3663, 'photo_desc_type': 5309, 'type_classification': 'mask_rcnn', 'hashtag_id_list': '0,0,0,0,0,0,0,0,0,0'}]
 thcl {'id': 2896, 'mtr_user_id': 31, 'name': 'learn_convoyeur_qualipapia_nantes_poly_100521_1', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'background,carton_brun,carton_gris,cartonnette,kraft,autre_refus,metal,plastique,teint_dans_la_masse,environnement', 'svm_portfolios_learning': '0,0,0,0,0,0,0,0,0,0', 'photo_hashtag_type': 3663, 'photo_desc_type': 5309, 'type_classification': 'mask_rcnn', 'hashtag_id_list': '0,0,0,0,0,0,0,0,0,0'}
 Update svm_hashtag_type_desc : 5309
['background', 'carton_brun', 'carton_gris', 'cartonnette', 'kraft', 'autre_refus', 'metal', 'plastique', 'teint_dans_la_masse', 'environnement']
time for calcul the mask position with numpy : 0.011785507202148438
nb_pixel_total : 443927
time to create 1 rle with new method : 0.03153872489929199
length of segment :  847
time for calcul the mask position with numpy : 0.0014276504516601562
nb_pixel_total : 90987
time to create 1 rle with old method : 0.09703278541564941
length of segment :  323
time for calcul the mask position with numpy : 0.0034596920013427734
nb_pixel_total : 193635
time to create 1 rle with new method : 0.01049184799194336
length of segment :  816
time for calcul the mask position with numpy : 0.0021300315856933594
nb_pixel_total : 156439
time to create 1 rle with new method : 0.005483865737915039
length of segment :  656
time for calcul the mask position with numpy : 0.00228118896484375
nb_pixel_total : 102815
time to create 1 rle with old method : 0.10900259017944336
length of segment :  965
time for calcul the mask position with numpy : 0.0005121231079101562
nb_pixel_total : 28596
time to create 1 rle with old method : 0.03141498565673828
length of segment :  308
time for calcul the mask position with numpy : 0.0014767646789550781
nb_pixel_total : 93950
time to create 1 rle with old method : 0.10030889511108398
length of segment :  334
time for calcul the mask position with numpy : 0.003772258758544922
nb_pixel_total : 274752
time to create 1 rle with new method : 0.009830474853515625
length of segment :  815
time for calcul the mask position with numpy : 0.0012962818145751953
nb_pixel_total : 61721
time to create 1 rle with old method : 0.0676724910736084
length of segment :  338
time for calcul the mask position with numpy : 0.0016891956329345703
nb_pixel_total : 91027
time to create 1 rle with old method : 0.1015932559967041
length of segment :  446
time for calcul the mask position with numpy : 0.0016467571258544922
nb_pixel_total : 100435
time to create 1 rle with old method : 0.10652971267700195
length of segment :  385
time for calcul the mask position with numpy : 0.009001731872558594
nb_pixel_total : 456156
time to create 1 rle with new method : 0.023820877075195312
length of segment :  876
time for calcul the mask position with numpy : 0.011943578720092773
nb_pixel_total : 348757
time to create 1 rle with new method : 0.021344900131225586
length of segment :  846
time for calcul the mask position with numpy : 0.011934995651245117
nb_pixel_total : 704280
time to create 1 rle with new method : 0.028340578079223633
length of segment :  1309
time for calcul the mask position with numpy : 0.00804901123046875
nb_pixel_total : 164025
time to create 1 rle with new method : 0.020115137100219727
length of segment :  819
time for calcul the mask position with numpy : 0.004823923110961914
nb_pixel_total : 220646
time to create 1 rle with new method : 0.00785684585571289
length of segment :  559
time for calcul the mask position with numpy : 0.0022504329681396484
nb_pixel_total : 118410
time to create 1 rle with old method : 0.12497758865356445
length of segment :  513
time for calcul the mask position with numpy : 0.0005218982696533203
nb_pixel_total : 14638
time to create 1 rle with old method : 0.015202522277832031
length of segment :  196
time for calcul the mask position with numpy : 0.04750180244445801
nb_pixel_total : 1917110
time to create 1 rle with new method : 0.09564995765686035
length of segment :  2408
time for calcul the mask position with numpy : 0.01057291030883789
nb_pixel_total : 300693
time to create 1 rle with new method : 0.015141010284423828
length of segment :  1978
time for calcul the mask position with numpy : 0.002279996871948242
nb_pixel_total : 177528
time to create 1 rle with new method : 0.004608154296875
length of segment :  512
time for calcul the mask position with numpy : 0.0006072521209716797
nb_pixel_total : 17306
time to create 1 rle with old method : 0.019428014755249023
length of segment :  192
time for calcul the mask position with numpy : 0.0026595592498779297
nb_pixel_total : 131668
time to create 1 rle with old method : 0.1447465419769287
length of segment :  699
time for calcul the mask position with numpy : 0.002498626708984375
nb_pixel_total : 119933
time to create 1 rle with old method : 0.13018369674682617
length of segment :  904
time for calcul the mask position with numpy : 0.0031299591064453125
nb_pixel_total : 122433
time to create 1 rle with old method : 0.13452887535095215
length of segment :  457
time for calcul the mask position with numpy : 0.0006725788116455078
nb_pixel_total : 38410
time to create 1 rle with old method : 0.04268670082092285
length of segment :  256
time for calcul the mask position with numpy : 0.0007798671722412109
nb_pixel_total : 58809
time to create 1 rle with old method : 0.06623435020446777
length of segment :  271
time for calcul the mask position with numpy : 0.08164024353027344
nb_pixel_total : 5564541
time to create 1 rle with new method : 0.256345272064209
length of segment :  3515
time for calcul the mask position with numpy : 0.020578861236572266
nb_pixel_total : 891760
time to create 1 rle with new method : 0.08463740348815918
length of segment :  2207
time for calcul the mask position with numpy : 0.0004937648773193359
nb_pixel_total : 18662
time to create 1 rle with old method : 0.028512001037597656
length of segment :  124

time spent for convertir_results : 6.738558530807495
Inside saveOutput : final : False verbose : 0
eke 12-6-18 : saveMask need to be cleaned for new output !
Number saved : None
batch 1
Loaded 30 chid ids of type : 3663
Number RLEs to save : 24874

save missing photos in datou_result :

time spend for datou_step_exec : 32.39345383644104
time spend to save output : 1.6845078468322754
total time spend for step 1 : 34.077961683273315

step2:crop_condition
Fri Feb  7 11:22:06 2025
VR 17-11-17 : now, only for linear exec dependencies tree, some output goes to fill the input of the next
VR 22-3-18 : now we test the dependencies tree, but keep two separate code for datou_prepare_output_input until the code is correctly tested, clean and works in both case
VR 22-3-18 : but we use the first code for the first step id = -1, build in the code of datou_exec 
VR 22-3-18 : we should manage here the case when we are at the first step instead of building this step before datou_exec
Currently we do not manage missing dependencies information, that could maybe be correctly interpreted with default behavior 
Some of the step done at execution of the step could be done before when the tree of execution is build and the dependencies of different step analysed
We should have FATAL ERROR but same_nb_input_output==True : this should be an optionnal input !
We should have FATAL ERROR but same_nb_input_output==True : this should be an optionnal input !
VR 22-3-18 : For now we do not clean correctly the datou structure 
Loading chi in step crop with photo_hashtag_type : 3663
Loading chi in step crop for list_pids : 6 !
batch 1
Loaded 30 chid ids of type : 3663
+++++++++++++++++++++++++++++++++++++++++++++++++
begin to crop the class : teint_dans_la_masse
param for this class : {'min_score': 0.7}
filtre for class : teint_dans_la_masse 
hashtag_id of this class : 2107752385
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
 map_result returned by crop_photo_return_map_crop : length : 7
 About to insert : list_path_to_insert length 7 new photo from crops !
we have finished the crop for the class : teint_dans_la_masse


begin to crop the class : autre_refus
param for this class : {'min_score': 0.5}
filtre for class : autre_refus 
hashtag_id of this class : 2107752406

begin to crop the class : carton_gris
param for this class : {'min_score': 0.5}
filtre for class : carton_gris 
hashtag_id of this class : 2107753020
we have both polygon and rles
Next one !
 map_result returned by crop_photo_return_map_crop : length : 1
 About to insert : list_path_to_insert length 1 new photo from crops !
we have finished the crop for the class : carton_gris


begin to crop the class : cartonnette
param for this class : {'min_score': 0.5}
filtre for class : cartonnette 
hashtag_id of this class : 702398920
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
 map_result returned by crop_photo_return_map_crop : length : 9
 About to insert : list_path_to_insert length 9 new photo from crops !
we have finished the crop for the class : cartonnette


begin to crop the class : carton_brun
param for this class : {'min_score': 0.7}
filtre for class : carton_brun 
hashtag_id of this class : 2107753024
we have both polygon and rles
Next one !
we have both polygon and rles
Next one !
 map_result returned by crop_photo_return_map_crop : length : 2
 About to insert : list_path_to_insert length 2 new photo from crops !
we have finished the crop for the class : carton_brun


begin to crop the class : plastique
param for this class : {'min_score': 0.5}
filtre for class : plastique 
hashtag_id of this class : 492725882
we have both polygon and rles
Next one !
 map_result returned by crop_photo_return_map_crop : length : 1
 About to insert : list_path_to_insert length 1 new photo from crops !
we have finished the crop for the class : plastique


begin to crop the class : kraft
param for this class : {'min_score': 0.5}
filtre for class : kraft 
hashtag_id of this class : 493202403

begin to crop the class : metal
param for this class : {'min_score': 0.5}
filtre for class : metal 
hashtag_id of this class : 492628673
delete rles for these photos
Inside saveOutput : final : False verbose : 0
 saveOutput not yet implemented for datou_step.type : crop_condition we use saveGeneral 
[1330234361, 1330234357, 1330234356, 1330234354, 1330234344, 1330234340]
Looping around the photos to save general results
len do output : 20
/-3659578709Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578707Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578708Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578719Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578716Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578715Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578723Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578722Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578711Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578710Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578714Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578721Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578718Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578717Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578727Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578731Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578732Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578713Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578736Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
/-3659578733Didn't retrieve data
.Didn't retrieve data
.Didn't retrieve data
.
before output type
Here is an output not treated by saveGeneral : <mtr.datou.datou_lib_object.DatouData object at 0x7f8390fbcc70>
Here is an output not treated by saveGeneral : <mtr.datou.datou_lib_object.DatouData object at 0x7f8390fbcbe0>
Here is an output not treated by saveGeneral : <mtr.datou.datou_lib_object.DatouData object at 0x7f8390fbcca0>
Managing all output in save final without adding information in the mtr_datou_result 
('3459', None, None, None, None, None, None, None, '2497290')
('3459', None, '1330234361', None, None, None, None, None, '2497290')
('3459', None, None, None, None, None, None, None, '2497290')
('3459', None, '1330234357', None, None, None, None, None, '2497290')
('3459', None, None, None, None, None, None, None, '2497290')
('3459', None, '1330234356', None, None, None, None, None, '2497290')
('3459', None, None, None, None, None, None, None, '2497290')
('3459', None, '1330234354', None, None, None, None, None, '2497290')
('3459', None, None, None, None, None, None, None, '2497290')
('3459', None, '1330234344', None, None, None, None, None, '2497290')
('3459', None, None, None, None, None, None, None, '2497290')
('3459', None, '1330234340', None, None, None, None, None, '2497290')


begin to insert list_values into mtr_datou_result : 
length of list_values in save_final : 66
time used for this insertion : 0.029804229736328125
 save_final 

save missing photos in datou_result :

time spend for datou_step_exec : 20.755817651748657
time spend to save output : 0.06439208984375
total time spend for step 2 : 20.820209741592407

step3:thcl
Fri Feb  7 11:22:27 2025
VR 17-11-17 : now, only for linear exec dependencies tree, some output goes to fill the input of the next
VR 22-3-18 : now we test the dependencies tree, but keep two separate code for datou_prepare_output_input until the code is correctly tested, clean and works in both case
VR 22-3-18 : but we use the first code for the first step id = -1, build in the code of datou_exec 
VR 22-3-18 : we should manage here the case when we are at the first step instead of building this step before datou_exec
Currently we do not manage missing dependencies information, that could maybe be correctly interpreted with default behavior 
Some of the step done at execution of the step could be done before when the tree of execution is build and the dependencies of different step analysed
complete output_args for input 0
VR 22-3-18 : For now we do not clean correctly the datou structure 

Beginning of datou step Thcl !
nombre de thcls : 2
we are using the classfication for multi_thcl 
[2456, 2868]


time to import caffe and check if the image exist :  0.014051198959350586
time to convert the images to numpy array :  0.021723508834838867
time to import caffe and check if the image exist :  0.012817621231079102
time to convert the images to numpy array :  0.04346013069152832
time to import caffe and check if the image exist :  0.011813879013061523
time to convert the images to numpy array :  0.05452275276184082
time to import caffe and check if the image exist :  0.016588687896728516
time to convert the images to numpy array :  1.436948537826538
time to import caffe and check if the image exist :  0.013433456420898438
time to convert the images to numpy array :  1.4404661655426025
time to import caffe and check if the image exist :  0.014639854431152344
time to convert the images to numpy array :  1.4416289329528809
time to import caffe and check if the image exist :  0.016585588455200195
time to convert the images to numpy array :  1.4484531879425049
time to import caffe and check if the image exist :  0.0160369873046875
time to convert the images to numpy array :  1.4497148990631104
time to import caffe and check if the image exist :  0.016413450241088867
time to convert the images to numpy array :  1.4617846012115479
time to import caffe and check if the image exist :  0.011970758438110352
time to convert the images to numpy array :  1.4765996932983398
total time to convert the images to numpy array :  1.9934766292572021
list photo_ids error:
[]
list photo_ids correct :
[-3659578736, -3659578733, -3659578732, -3659578713, -3659578716, -3659578715, -3659578727, -3659578731, -3659578723, -3659578722, -3659578711, -3659578710, -3659578708, -3659578719, -3659578714, -3659578721, -3659578718, -3659578717, -3659578709, -3659578707]
number of photos to traite : 20
try to delete the photos incorrect in DB

tagging for thcl : 2456
To do loadFromThcl(), then load ParamDescType : thcl2456
 thcls : [{'id': 2456, 'mtr_user_id': 31, 'name': 'learn_qualipapia_papier_refus_from_vlg_data_aug', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'papier,refus', 'svm_portfolios_learning': '3028087,3028251', 'photo_hashtag_type': 3049, 'photo_desc_type': 4999, 'type_classification': 'caffe', 'hashtag_id_list': '492668766,538914404'}]
 thcl {'id': 2456, 'mtr_user_id': 31, 'name': 'learn_qualipapia_papier_refus_from_vlg_data_aug', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'papier,refus', 'svm_portfolios_learning': '3028087,3028251', 'photo_hashtag_type': 3049, 'photo_desc_type': 4999, 'type_classification': 'caffe', 'hashtag_id_list': '492668766,538914404'}
 Update svm_hashtag_type_desc : 4999
FOUND : 1
 Here is data_from_sql_as_vec to set the ParamDescriptorType : (4999, 'learn_qualipapia_papier_refus_from_vlg_data_aug', 16384, 25088, 'learn_qualipapia_papier_refus_from_vlg_data_aug', 'res5b', 10.0, None, None, 256, None, 0, None, 8, None, None, -1000.0, 1, datetime.datetime(2020, 10, 23, 14, 27, 22), datetime.datetime(2020, 10, 23, 14, 27, 22))
To loadFromThcl() : net_4999
begin to check gpu status 
inside check gpu memory
l 3637 free memory gpu now : 2900  max_wait_temp :  1 max_wait :  0
FOUND : 1
 Here is data_from_sql_as_vec to set the ParamDescriptorType : (4999, 'learn_qualipapia_papier_refus_from_vlg_data_aug', 16384, 25088, 'learn_qualipapia_papier_refus_from_vlg_data_aug', 'res5b', 10.0, None, None, 256, None, 0, None, 8, None, None, -1000.0, 1, datetime.datetime(2020, 10, 23, 14, 27, 22), datetime.datetime(2020, 10, 23, 14, 27, 22))
None
 mean_file_type : 
 mean_file_path : 
 prototxt_file_path : 
 model : learn_qualipapia_papier_refus_from_vlg_data_aug
 Inside get_net  
 Inside get_net before cache_data_model 
model_param file didn't exist 
 Inside get_net before  CDM.load_model_par_type  
model_name : learn_qualipapia_papier_refus_from_vlg_data_aug
model_type : caffe
list file need :
['caffemodel', 'deploy_conv_normal.prototxt', 'deploy_fc.prototxt', 'deploy.prototxt', 'mean.npy', 'synset_words.txt']
file exist in s3 :
['caffemodel', 'deploy.prototxt', 'mean.npy', 'synset_words.txt']
file manque in s3 : 
['deploy_conv_normal.prototxt', 'deploy_fc.prototxt']
local folder  : /data/models_weight/learn_qualipapia_papier_refus_from_vlg_data_aug


/data/models_weight/learn_qualipapia_papier_refus_from_vlg_data_aug/caffemodel
size_local : 44972172 size in s3 : 44972172
create time local : 2021-08-09 05:55:48 create time in s3 : 2021-08-06 19:28:49
caffemodel already exist and didn't need to update


/data/models_weight/learn_qualipapia_papier_refus_from_vlg_data_aug/deploy.prototxt
size_local : 17311 size in s3 : 17311
create time local : 2021-08-09 05:55:48 create time in s3 : 2021-08-06 19:28:49
deploy.prototxt already exist and didn't need to update


/data/models_weight/learn_qualipapia_papier_refus_from_vlg_data_aug/mean.npy
size_local : 1572992 size in s3 : 1572992
create time local : 2021-08-09 05:55:48 create time in s3 : 2021-08-06 19:28:51
mean.npy already exist and didn't need to update


/data/models_weight/learn_qualipapia_papier_refus_from_vlg_data_aug/synset_words.txt
size_local : 57 size in s3 : 57
create time local : 2021-08-09 05:55:48 create time in s3 : 2021-08-06 19:28:49
synset_words.txt already exist and didn't need to update
 Inside get_net after CDM.load_model_par_type  
  After if not only_with_local_cache:
/home/admin/workarea/install/darknet/:/home/admin/workarea/git/Velours/python:/home/admin/workarea/install/caffe_frcnn_python3/py-faster-rcnn/caffe-fast-rcnn/python:/home/admin/mtr/.credentials:/home/admin/workarea/install/caffe/python:/home/admin/workarea/install/caffe_frcnn/py-faster-rcnn/tools/:/home/admin/workarea/git/fotonowerpip/:/home/admin/workarea/install/segment-anything:/home/admin//workarea/git/pyfvs/:/home/admin/workarea/git/apy/
Here before set mode gpu
Doing nothing but we could set mode gpu
after set mode gpu
 prototxt_filename : /data/models_weight/learn_qualipapia_papier_refus_from_vlg_data_aug/deploy.prototxt
 caffemodel_filename : /data/models_weight/learn_qualipapia_papier_refus_from_vlg_data_aug/caffemodel
now we set caffe to gpu mode


 before predict
begin to check gpu status 
inside check gpu memory
l 3637 free memory gpu now : 2681  max_wait_temp :  1 max_wait :  0
dict_keys(['prob'])
time used to do the prepocess of the images : 0.5020976066589355
time used to do the prediction : 0.3421928882598877
we don't save the descriptors for this thcl 2456


tagging for thcl : 2868
To do loadFromThcl(), then load ParamDescType : thcl2868
 thcls : [{'id': 2868, 'mtr_user_id': 31, 'name': 'learn_papier_nantes_300421', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'Carton_brun,Carton_gris,Teint_Dans_La_Masse,autre_refus,cartonnette,environnement,kraft,metal,papier,plastique', 'svm_portfolios_learning': '3752117,3752118,3752123,3752106,3752116,3752124,3752119,3581575,3486029,3752122', 'photo_hashtag_type': 3632, 'photo_desc_type': 5288, 'type_classification': 'caffe', 'hashtag_id_list': '2107753024,2107753020,2107752385,2107752406,702398920,493012381,493202403,492628673,492668766,492725882'}]
 thcl {'id': 2868, 'mtr_user_id': 31, 'name': 'learn_papier_nantes_300421', 'pb_hashtag_id': 0, 'live': b'\x00', 'list_hashtags': 'Carton_brun,Carton_gris,Teint_Dans_La_Masse,autre_refus,cartonnette,environnement,kraft,metal,papier,plastique', 'svm_portfolios_learning': '3752117,3752118,3752123,3752106,3752116,3752124,3752119,3581575,3486029,3752122', 'photo_hashtag_type': 3632, 'photo_desc_type': 5288, 'type_classification': 'caffe', 'hashtag_id_list': '2107753024,2107753020,2107752385,2107752406,702398920,493012381,493202403,492628673,492668766,492725882'}
 Update svm_hashtag_type_desc : 5288
FOUND : 1
 Here is data_from_sql_as_vec to set the ParamDescriptorType : (5288, 'learn_papier_nantes_300421', 512, 512, 'learn_papier_nantes_300421', 'pool5', 10.0, None, None, 256, None, 0, None, 8, None, None, -1000.0, 1, datetime.datetime(2021, 4, 30, 17, 9, 41), datetime.datetime(2021, 4, 30, 17, 9, 41))
To loadFromThcl() : net_5288
begin to check gpu status 
inside check gpu memory
l 3637 free memory gpu now : 2522  max_wait_temp :  1 max_wait :  0
FOUND : 1
 Here is data_from_sql_as_vec to set the ParamDescriptorType : (5288, 'learn_papier_nantes_300421', 512, 512, 'learn_papier_nantes_300421', 'pool5', 10.0, None, None, 256, None, 0, None, 8, None, None, -1000.0, 1, datetime.datetime(2021, 4, 30, 17, 9, 41), datetime.datetime(2021, 4, 30, 17, 9, 41))
None
 mean_file_type : 
 mean_file_path : 
 prototxt_file_path : 
 model : learn_papier_nantes_300421
 Inside get_net  
 Inside get_net before cache_data_model 
model_param file didn't exist 
 Inside get_net before  CDM.load_model_par_type  
model_name : learn_papier_nantes_300421
model_type : caffe
list file need :
['caffemodel', 'deploy_conv_normal.prototxt', 'deploy_fc.prototxt', 'deploy.prototxt', 'mean.npy', 'synset_words.txt']
file exist in s3 :
['caffemodel', 'deploy.prototxt', 'mean.npy', 'synset_words.txt']
file manque in s3 : 
['deploy_conv_normal.prototxt', 'deploy_fc.prototxt']
local folder  : /data/models_weight/learn_papier_nantes_300421


/data/models_weight/learn_papier_nantes_300421/caffemodel
size_local : 44791983 size in s3 : 44791983
create time local : 2021-08-09 05:55:59 create time in s3 : 2021-08-06 19:22:13
caffemodel already exist and didn't need to update


/data/models_weight/learn_papier_nantes_300421/deploy.prototxt
size_local : 17255 size in s3 : 17255
create time local : 2021-08-09 05:55:59 create time in s3 : 2021-08-06 19:22:12
deploy.prototxt already exist and didn't need to update


/data/models_weight/learn_papier_nantes_300421/mean.npy
size_local : 1572992 size in s3 : 1572992
create time local : 2021-08-09 05:55:59 create time in s3 : 2021-08-06 19:22:14
mean.npy already exist and didn't need to update


/data/models_weight/learn_papier_nantes_300421/synset_words.txt
size_local : 331 size in s3 : 331
create time local : 2021-08-09 05:56:00 create time in s3 : 2021-08-06 19:22:12
synset_words.txt already exist and didn't need to update
 Inside get_net after CDM.load_model_par_type  
  After if not only_with_local_cache:
/home/admin/workarea/install/darknet/:/home/admin/workarea/git/Velours/python:/home/admin/workarea/install/caffe_frcnn_python3/py-faster-rcnn/caffe-fast-rcnn/python:/home/admin/mtr/.credentials:/home/admin/workarea/install/caffe/python:/home/admin/workarea/install/caffe_frcnn/py-faster-rcnn/tools/:/home/admin/workarea/git/fotonowerpip/:/home/admin/workarea/install/segment-anything:/home/admin//workarea/git/pyfvs/:/home/admin/workarea/git/apy/
Here before set mode gpu
Doing nothing but we could set mode gpu
after set mode gpu
 prototxt_filename : /data/models_weight/learn_papier_nantes_300421/deploy.prototxt
 caffemodel_filename : /data/models_weight/learn_papier_nantes_300421/caffemodel
now we set caffe to gpu mode


 before predict
begin to check gpu status 
inside check gpu memory
havn't enough memory gpu , need / 2500
l 3632 free memory gpu now : 2434
wait 20 seconds
WARNING: Logging before InitGoogleLogging() is written to STDERR
F0207 11:22:58.451659 3207435 syncedmem.cpp:71] Check failed: error == cudaSuccess (2 vs. 0)  out of memory
*** Check failure stack trace: ***
Command terminated by signal 6
40.06user 29.27system 1:29.29elapsed 77%CPU (0avgtext+0avgdata 3146532maxresident)k
576968inputs+26400outputs (1956major+2503111minor)pagefaults 0swaps