Skip to content

Instantly share code, notes, and snippets.

@trevor-m
Created June 5, 2023 18:27
Show Gist options
  • Save trevor-m/fb711ae034a37a9c117253f09b92dd76 to your computer and use it in GitHub Desktop.
Save trevor-m/fb711ae034a37a9c117253f09b92dd76 to your computer and use it in GitHub Desktop.
2023-06-05 17:37:07.425028: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/lib/x86_64-linux-gnu
2023-06-05 17:37:15.664716: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/lib/x86_64-linux-gnu
2023-06-05 17:37:15.664797: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcublas.so.11'; dlerror: libcublas.so.11: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/lib/x86_64-linux-gnu
2023-06-05 17:37:15.664863: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcublasLt.so.11'; dlerror: libcublasLt.so.11: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/lib/x86_64-linux-gnu
2023-06-05 17:37:15.664927: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcufft.so.10'; dlerror: libcufft.so.10: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/lib/x86_64-linux-gnu
2023-06-05 17:37:15.696031: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcusparse.so.11'; dlerror: libcusparse.so.11: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64:/usr/lib/x86_64-linux-gnu
2023-06-05 17:37:15.696218: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1850] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
[IREE-PJRT] DEBUG: Using IREE compiler binary: /usr/local/lib/python3.10/dist-packages/iree/compiler/_mlir_libs/libIREECompiler.so
[IREE-PJRT] DEBUG: Compiler Version: 20230604.542 @ 7400c8546f33cc76680c5e235d17617a5ec8c18c (API version 1.2)
[IREE-PJRT] DEBUG: Using partitioner binary: /workspace/openxla-pjrt-plugin/bazel-bin/partitioner/libOpenXLAPartitioner.so
[IREE-PJRT] DEBUG: Partitioner version: <unknown> (API version 1.1)
[IREE-PJRT] DEBUG: CUDA driver created
I0605 17:37:15.740515 139756486557120 setup_jax.py:72] JAX process: 0 / 1
I0605 17:37:15.740673 139756486557120 setup_jax.py:73] JAX devices: [GPU-b0fbccec-7593-9c0c-35de-cbfc04b9d09a]
I0605 17:37:15.740910 139756486557120 setup_jax.py:74] jax.device_count(): 1
I0605 17:37:15.741026 139756486557120 setup_jax.py:75] jax.local_device_count(): 1
I0605 17:37:15.741060 139756486557120 setup_jax.py:76] jax.process_count(): 1
Registered experiment `paxml.tasks.lm.params.lm_cloud.LargeMlp`
Registered experiment `paxml.tasks.lm.params.lm_cloud.SmallMlp`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudTransformerAdam`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudTransformerAdamTest`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudTransformerAdamLimitSteps`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmdTest`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd2B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd2BLimitSteps`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd32B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd64B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd128B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd256B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd512B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmd1024B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmdPipeline9B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmdPipeline175B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmdMultislice2B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmdPipelineMultislice2B`
Registered experiment `paxml.tasks.lm.params.lm_cloud.LmCloudSpmdPipelineMultislice2BCircular`
Registered experiment `paxml.tasks.lm.params.c4.LmCloudSpmdAdam`
Registered experiment `paxml.tasks.lm.params.c4.LmCloudSpmdAdamLimitSteps`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdAdam`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdGpt3AdamOrgHPBS1p5k1536Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineAdam`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineGpt3AdamOrgHPBS1p5k768Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineGpt3AdamMLPerfHPBS1p5k768Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineGpt3AdamMLPerfHPBS2k512Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineGpt3AdamMLPerfHPBS3k768Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineGpt3AdamMLPerfHPBS4k1024Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineGpt3AdamMLPerfHPBS8k1024Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4Spmd1BAdam4Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4Spmd1BAdam4ReplicasLimitSteps`
Registered experiment `paxml.tasks.lm.params.c4.C4Spmd2BAdam4Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4Spmd16BAdam32Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4Spmd32BAdam64Replicas`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdGpt3L16AdamOrgHP`
Registered experiment `paxml.tasks.lm.params.c4.C4SpmdPipelineGpt3SmallAdam8Replicas`
W0605 17:37:16.975340 139756486557120 gpu_fast_attention.py:41] jax_triton not found, please `pip install jax-triton`
Registered experiment `tasks.lm.params.nvidia.NVIDIA1_3B`
Registered experiment `tasks.lm.params.nvidia.NVIDIA5B`
Registered experiment `tasks.lm.params.nvidia.NVIDIA8_3B`
Registered experiment `tasks.lm.params.nvidia.NVIDIA10B`
Registered experiment `tasks.lm.params.nvidia.NVIDIA40BProxy`
Registered experiment `tasks.lm.params.nvidia.NVIDIA70BProxy`
Registered experiment `tasks.lm.params.nvidia.NVIDIA116BProxy`
Registered experiment `tasks.lm.params.nvidia.NVIDIA175BProxy`
Registered experiment `tasks.lm.params.nvidia.TestSmallConfig`
Registered experiment `tasks.lm.params.nvidia.NVIDIA1_3BPmap`
I0605 17:37:16.984491 139756486557120 local.py:45] Setting task status: process_index: 0, process_count: 1
I0605 17:37:16.984750 139756486557120 local.py:50] Created artifact job_log_dir of type ArtifactType.DIRECTORY and value log_NVIDIA1_3BPmap.
I0605 17:37:17.464085 139756486557120 local.py:45] Setting task status: Train experiment tasks.lm.params.nvidia.NVIDIA1_3BPmap at log_NVIDIA1_3BPmap
I0605 17:37:17.464244 139756486557120 train.py:139] [PAX STATUS] Starting `train_and_evaluate`
I0605 17:37:17.601573 139756486557120 train.py:146] [PAX STATUS] Obtaining and initializing datasets.
I0605 17:37:17.604288 139756486557120 train.py:162] [PAX STATUS]: Done initializing dataset objects
I0605 17:37:17.604343 139756486557120 train.py:164] train_input_p:
I0605 17:37:17.604944 139756486557120 train.py:168] allow_fixed_file_random_seed : False
I0605 17:37:17.604993 139756486557120 train.py:168] batch_padding_size : 0
I0605 17:37:17.605027 139756486557120 train.py:168] batch_size : NoneType
I0605 17:37:17.605060 139756486557120 train.py:168] cls : type/praxis.base_input/LingvoInputAdaptor
I0605 17:37:17.605092 139756486557120 train.py:168] cluster_do_eval : False
I0605 17:37:17.605139 139756486557120 train.py:168] custom_device_order : NoneType
I0605 17:37:17.605173 139756486557120 train.py:168] eval_loop_num_batches : 1
I0605 17:37:17.605206 139756486557120 train.py:168] experimental_remote_input : False
I0605 17:37:17.605238 139756486557120 train.py:168] infeed_host_index : 0
I0605 17:37:17.605269 139756486557120 train.py:168] input.activation_split_dims_mapping : NoneType
I0605 17:37:17.605301 139756486557120 train.py:168] input.add_name_to_theta : False
I0605 17:37:17.605332 139756486557120 train.py:168] input.allow_implicit_capture : NoneType
I0605 17:37:17.605364 139756486557120 train.py:168] input.batch_size : 1
I0605 17:37:17.605396 139756486557120 train.py:168] input.cls : type/paxml.tasks.lm.input_generator/SyntheticLmData
I0605 17:37:17.605428 139756486557120 train.py:168] input.decoder_samples_per_summary : NoneType
I0605 17:37:17.605461 139756486557120 train.py:168] input.device_mesh : NoneType
I0605 17:37:17.605493 139756486557120 train.py:168] input.dtype : float32
I0605 17:37:17.605524 139756486557120 train.py:168] input.eval_samples_per_summary : NoneType
I0605 17:37:17.605556 139756486557120 train.py:168] input.file_datasource : NoneType
I0605 17:37:17.605586 139756486557120 train.py:168] input.filter_sparse_tensors : False
I0605 17:37:17.605615 139756486557120 train.py:168] input.fprop_dtype : NoneType
I0605 17:37:17.605644 139756486557120 train.py:168] input.inference_driver_name : NoneType
I0605 17:37:17.605674 139756486557120 train.py:168] input.input_stats_summary_interval_steps : 10
I0605 17:37:17.605703 139756486557120 train.py:168] input.is_inference : NoneType
I0605 17:37:17.605733 139756486557120 train.py:168] input.name : 'input'
I0605 17:37:17.605762 139756486557120 train.py:168] input.num_partitions : NoneType
I0605 17:37:17.605792 139756486557120 train.py:168] input.num_samples : 0
I0605 17:37:17.605822 139756486557120 train.py:168] input.outfeed_in_logical_order : False
I0605 17:37:17.605851 139756486557120 train.py:168] input.params_init.custom_v_init : NoneType
I0605 17:37:17.605880 139756486557120 train.py:168] input.params_init.method : 'xavier'
I0605 17:37:17.605910 139756486557120 train.py:168] input.params_init.scale : 1.000001
I0605 17:37:17.605938 139756486557120 train.py:168] input.params_init.seed : NoneType
I0605 17:37:17.605968 139756486557120 train.py:168] input.random_seed : NoneType
I0605 17:37:17.605997 139756486557120 train.py:168] input.remote.max_inflights_per_target : 32
I0605 17:37:17.606026 139756486557120 train.py:168] input.resettable : False
I0605 17:37:17.606055 139756486557120 train.py:168] input.seq_len : 2048
I0605 17:37:17.606085 139756486557120 train.py:168] input.skip_lp_regularization : NoneType
I0605 17:37:17.606114 139756486557120 train.py:168] input.tpu_embedding_mode : 'train'
I0605 17:37:17.606144 139756486557120 train.py:168] input.tpu_infeed_parallelism : 1
I0605 17:37:17.606173 139756486557120 train.py:168] input.use_partitioned_infeed_queue : False
I0605 17:37:17.606202 139756486557120 train.py:168] input.use_per_core_infeed : False
I0605 17:37:17.606231 139756486557120 train.py:168] input.use_per_host_infeed : False
I0605 17:37:17.606261 139756486557120 train.py:168] input.vn.deterministic : NoneType
I0605 17:37:17.606290 139756486557120 train.py:168] input.vn.global_vn : False
I0605 17:37:17.606320 139756486557120 train.py:168] input.vn.per_step_vn : False
I0605 17:37:17.606349 139756486557120 train.py:168] input.vn.scale : NoneType
I0605 17:37:17.606379 139756486557120 train.py:168] input.vn.seed : NoneType
I0605 17:37:17.606409 139756486557120 train.py:168] input.vn.start_step : 0
I0605 17:37:17.606438 139756486557120 train.py:168] input.weight_split_dims_mapping : NoneType
I0605 17:37:17.606467 139756486557120 train.py:168] input_checkpointing_enabled : False
I0605 17:37:17.606497 139756486557120 train.py:168] input_random_seed : NoneType
I0605 17:37:17.606526 139756486557120 train.py:168] is_training : True
I0605 17:37:17.606556 139756486557120 train.py:168] name : ''
I0605 17:37:17.606590 139756486557120 train.py:168] num_batches : NoneType
I0605 17:37:17.606620 139756486557120 train.py:168] num_infeed_hosts : 0
I0605 17:37:17.606650 139756486557120 train.py:168] reset_for_eval : False
I0605 17:37:17.606679 139756486557120 train.py:168] tf_data_service_address : NoneType
I0605 17:37:17.606710 139756486557120 train.py:169] task_p:
I0605 17:37:17.628924 139756486557120 train.py:171] cls : type/paxml.tasks_lib/SingleTask
I0605 17:37:17.629092 139756486557120 train.py:171] decode.cls : type/paxml.tasks_lib/SingleTask.Decode
I0605 17:37:17.629150 139756486557120 train.py:171] decode.prng_key_fold_with_batch_index : False
I0605 17:37:17.629184 139756486557120 train.py:171] decode.prng_key_fold_with_global_step : True
I0605 17:37:17.629215 139756486557120 train.py:171] decode.random_seed : 1234
I0605 17:37:17.629246 139756486557120 train.py:171] early_stopping_fn : NoneType
I0605 17:37:17.629276 139756486557120 train.py:171] evaluate.apply_mutable_list : ['aux_loss', 'summaries', 'non_trainable']
I0605 17:37:17.629308 139756486557120 train.py:171] evaluate.cls : type/paxml.tasks_lib/SingleTask.Evaluate
I0605 17:37:17.629338 139756486557120 train.py:171] evaluate.random_seed : 1234
I0605 17:37:17.629369 139756486557120 train.py:171] infer.cls : type/paxml.tasks_lib/SingleTask.Infer
I0605 17:37:17.629403 139756486557120 train.py:171] infer.random_seed : 1234
I0605 17:37:17.629433 139756486557120 train.py:171] infer_writer : NoneType
I0605 17:37:17.629464 139756486557120 train.py:171] loss_aggregator : NoneType
I0605 17:37:17.629495 139756486557120 train.py:171] metrics : NoneType
I0605 17:37:17.629525 139756486557120 train.py:171] model.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.629556 139756486557120 train.py:171] model.apply_eval_sample_weights : False
I0605 17:37:17.629586 139756486557120 train.py:171] model.cls : type/praxis.layers.models/LanguageModel
I0605 17:37:17.629616 139756486557120 train.py:171] model.contiguous_submeshes : NoneType
I0605 17:37:17.629646 139756486557120 train.py:171] model.count_tokens : False
I0605 17:37:17.629676 139756486557120 train.py:171] model.dcn_mesh_shape : NoneType
I0605 17:37:17.629707 139756486557120 train.py:171] model.decoder_tpl.cls : type/praxis.decoder_hparams/GreedyDecoderHParams
I0605 17:37:17.629737 139756486557120 train.py:171] model.decoder_tpl.decode_loop_mesh_axes_transpose : NoneType
I0605 17:37:17.629767 139756486557120 train.py:171] model.decoder_tpl.emb_lookup_style : 'matmul'
I0605 17:37:17.629796 139756486557120 train.py:171] model.decoder_tpl.eos_id : 2
I0605 17:37:17.629826 139756486557120 train.py:171] model.decoder_tpl.fprop_for_prefix : False
I0605 17:37:17.629856 139756486557120 train.py:171] model.decoder_tpl.lazy_prefix_broadcast : False
I0605 17:37:17.629886 139756486557120 train.py:171] model.decoder_tpl.max_decode_steps : NoneType
I0605 17:37:17.629916 139756486557120 train.py:171] model.decoder_tpl.min_prefix_len : 5
I0605 17:37:17.629946 139756486557120 train.py:171] model.decoder_tpl.process_result_fn : NoneType
I0605 17:37:17.629976 139756486557120 train.py:171] model.decoder_tpl.seqlen : 0
I0605 17:37:17.630006 139756486557120 train.py:171] model.dtype : type/jax.numpy/float32
I0605 17:37:17.630036 139756486557120 train.py:171] model.fprop_dtype : dtype[float32]
I0605 17:37:17.630066 139756486557120 train.py:171] model.ici_mesh_shape : NoneType
I0605 17:37:17.630096 139756486557120 train.py:171] model.lm_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.630126 139756486557120 train.py:171] model.lm_tpl.cls : type/praxis.layers.transformer_models/TransformerLm
I0605 17:37:17.630156 139756486557120 train.py:171] model.lm_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.630186 139756486557120 train.py:171] model.lm_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.630216 139756486557120 train.py:171] model.lm_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.630246 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.630275 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.cls : type/praxis.layers.normalizations/LayerNorm
I0605 17:37:17.630306 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.630335 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.630365 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.dim : 0
I0605 17:37:17.630395 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.630425 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.epsilon : 1e-06
I0605 17:37:17.630455 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.fprop_dtype : NoneType
I0605 17:37:17.630485 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.630515 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.mesh_axis_names : NoneType
I0605 17:37:17.630545 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.name : NoneType
I0605 17:37:17.630575 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.630605 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.params_init.method : 'xavier'
I0605 17:37:17.630635 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.params_init.scale : 1.000001
I0605 17:37:17.630665 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.reductions_in_fp32 : False
I0605 17:37:17.630694 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.630724 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.630754 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.use_bias : True
I0605 17:37:17.630784 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.use_scale : True
I0605 17:37:17.630813 139756486557120 train.py:171] model.lm_tpl.final_ln_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.630843 139756486557120 train.py:171] model.lm_tpl.fprop_dtype : NoneType
I0605 17:37:17.630873 139756486557120 train.py:171] model.lm_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.630903 139756486557120 train.py:171] model.lm_tpl.mesh_axis_names : NoneType
I0605 17:37:17.630933 139756486557120 train.py:171] model.lm_tpl.model_dims : 2048
I0605 17:37:17.630962 139756486557120 train.py:171] model.lm_tpl.model_type : 'causal'
I0605 17:37:17.630992 139756486557120 train.py:171] model.lm_tpl.name : NoneType
I0605 17:37:17.631022 139756486557120 train.py:171] model.lm_tpl.ngrammer_tpl : NoneType
I0605 17:37:17.631052 139756486557120 train.py:171] model.lm_tpl.packed_input : True
I0605 17:37:17.631083 139756486557120 train.py:171] model.lm_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.631113 139756486557120 train.py:171] model.lm_tpl.params_init.method : 'xavier'
I0605 17:37:17.631142 139756486557120 train.py:171] model.lm_tpl.params_init.scale : 1.000001
I0605 17:37:17.631172 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.activation_split_dims_mapping.emb_out_split_dims_mapping : NoneType
I0605 17:37:17.631202 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.631233 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.631264 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.cls : type/praxis.layers.base_ops/ArrayLookup
I0605 17:37:17.631293 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.631323 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.631354 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.631383 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.fprop_dtype : NoneType
I0605 17:37:17.631413 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.631443 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.mesh_axis_names : NoneType
I0605 17:37:17.631474 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.name : NoneType
I0605 17:37:17.631504 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.631534 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.params_init.method : 'xavier'
I0605 17:37:17.631564 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.params_init.scale : 1.000001
I0605 17:37:17.631594 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.631624 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.631654 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.array_lookup_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.631683 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.cls : type/praxis.layers.embedding_softmax/TrainablePositionalEmbedding
I0605 17:37:17.631715 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.631745 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.631775 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.631805 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.631835 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.631865 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.631894 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.631924 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.631954 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.631984 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.632014 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.632044 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.name : NoneType
I0605 17:37:17.632073 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.632104 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.632134 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.632164 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.632194 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.632224 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.632253 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.embedding_dims : 0
I0605 17:37:17.632283 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.fprop_dtype : NoneType
I0605 17:37:17.632313 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.632343 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.lookup_style : 'matmul'
I0605 17:37:17.632373 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.max_seq_length : 2048
I0605 17:37:17.632403 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.max_timescale : 10000
I0605 17:37:17.632433 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.mesh_axis_names : NoneType
I0605 17:37:17.632463 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.min_timescale : 1
I0605 17:37:17.632493 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.name : NoneType
I0605 17:37:17.632523 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.632553 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.params_init.method : 'xavier'
I0605 17:37:17.632583 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.params_init.scale : 1.000001
I0605 17:37:17.632613 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.632643 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.632673 139756486557120 train.py:171] model.lm_tpl.position_emb_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.632703 139756486557120 train.py:171] model.lm_tpl.post_attention_ngrammer_tpls : NoneType
I0605 17:37:17.632733 139756486557120 train.py:171] model.lm_tpl.record_activations_in_xent_output : False
I0605 17:37:17.632762 139756486557120 train.py:171] model.lm_tpl.separate_embedding_tpl : NoneType
I0605 17:37:17.632792 139756486557120 train.py:171] model.lm_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.632822 139756486557120 train.py:171] model.lm_tpl.skip_aux_loss : False
I0605 17:37:17.632852 139756486557120 train.py:171] model.lm_tpl.skip_compute_loss : False
I0605 17:37:17.632882 139756486557120 train.py:171] model.lm_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.632912 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.activation_split_dims_mapping.emb_out_split_dims_mapping : NoneType
I0605 17:37:17.632942 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.632971 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.633002 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.cls : type/praxis.layers.base_ops/ArrayLookup
I0605 17:37:17.633031 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.633061 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.633091 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.633129 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.fprop_dtype : NoneType
I0605 17:37:17.633161 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.633191 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.mesh_axis_names : NoneType
I0605 17:37:17.633221 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.name : NoneType
I0605 17:37:17.633251 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.633281 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.params_init.method : 'xavier'
I0605 17:37:17.633311 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.params_init.scale : 1.000001
I0605 17:37:17.633340 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.633370 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.633401 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.array_lookup_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.633431 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.bi_tempered_loss_tpl : NoneType
I0605 17:37:17.633461 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.bias_init : 0.0
I0605 17:37:17.633491 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.cls : type/praxis.layers.embedding_softmax/SharedEmbeddingSoftmax
I0605 17:37:17.633521 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.633551 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.633581 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.633611 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.633641 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.633671 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.633702 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.633732 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.633762 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.633792 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.633822 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.633852 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.name : NoneType
I0605 17:37:17.633882 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.633912 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.633942 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.633972 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.634001 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.634031 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.634061 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.634091 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.634122 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.cls : type/praxis.layers.activations/ReLU
I0605 17:37:17.634152 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.634182 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.634212 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.634242 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.fprop_dtype : NoneType
I0605 17:37:17.634272 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.634302 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.mesh_axis_names : NoneType
I0605 17:37:17.634332 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.name : NoneType
I0605 17:37:17.634362 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.634393 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.params_init.method : 'xavier'
I0605 17:37:17.634423 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.params_init.scale : 1.000001
I0605 17:37:17.634453 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.634483 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.634512 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.activation_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.634542 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.bias_init : 0.0
I0605 17:37:17.634572 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.cls : type/praxis.layers.linears/FeedForward
I0605 17:37:17.634603 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.634633 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.634662 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.634692 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.fprop_dtype : NoneType
I0605 17:37:17.634722 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.has_bias : True
I0605 17:37:17.634752 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.634781 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.input_dims : 0
I0605 17:37:17.634811 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.634842 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.cls : type/praxis.layers.linears/Linear
I0605 17:37:17.634871 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.634902 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.634932 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.634961 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.634991 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.635022 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.635052 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.635082 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.635112 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.635141 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.635172 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.635202 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.name : NoneType
I0605 17:37:17.635232 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.635263 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.635293 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.635324 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.635354 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.635384 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.635414 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.fprop_dtype : NoneType
I0605 17:37:17.635444 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.635474 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.input_dims : 0
I0605 17:37:17.635504 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.mesh_axis_names : NoneType
I0605 17:37:17.635534 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.name : NoneType
I0605 17:37:17.635565 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.output_dims : 0
I0605 17:37:17.635594 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.635624 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.params_init.method : 'xavier'
I0605 17:37:17.635654 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.params_init.scale : 1.000001
I0605 17:37:17.635684 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.635714 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.635744 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.weight_init : NoneType
I0605 17:37:17.635774 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.linear_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.635804 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.mesh_axis_names : NoneType
I0605 17:37:17.635834 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.name : NoneType
I0605 17:37:17.635864 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.output_dims : 0
I0605 17:37:17.635895 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.635925 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.params_init.method : 'xavier'
I0605 17:37:17.635955 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.params_init.scale : 1.000001
I0605 17:37:17.635985 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.636015 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.636045 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.weight_init : NoneType
I0605 17:37:17.636075 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.feed_forward_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.636105 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.fprop_dtype : NoneType
I0605 17:37:17.636136 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.636165 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.input_dims : 0
I0605 17:37:17.636195 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.label_smoothing_apply_for_eval : True
I0605 17:37:17.636226 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.label_smoothing_prob : 0.0
I0605 17:37:17.636256 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.lookup_style : 'index'
I0605 17:37:17.636286 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.mesh_axis_names : NoneType
I0605 17:37:17.636316 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.name : NoneType
I0605 17:37:17.636346 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.num_classes : 0
I0605 17:37:17.636376 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.params_init.method : 'gaussian'
I0605 17:37:17.636406 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.params_init.scale : 0.022097086912079608
I0605 17:37:17.636436 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.scale_sqrt_depth : True
I0605 17:37:17.636466 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.636496 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.636526 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.soft_cap_logits : 30.0
I0605 17:37:17.636556 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.636586 139756486557120 train.py:171] model.lm_tpl.softmax_tpl.z_loss_weight : 0.0
I0605 17:37:17.636616 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.636646 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.636676 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.atten_dropout_prob : NoneType
I0605 17:37:17.636706 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.cls : type/praxis.layers.transformers/StackedTransformer
I0605 17:37:17.636736 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.contiguous_submeshes : NoneType
I0605 17:37:17.636766 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.dcn_mesh_shape : NoneType
I0605 17:37:17.636796 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.dim_per_head : 64
I0605 17:37:17.636826 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.dropout_prob : 0.0
I0605 17:37:17.636857 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.dtype : type/jax.numpy/float32
I0605 17:37:17.636888 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.fold_padding_with_segment_mask : False
I0605 17:37:17.636918 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.fprop_dtype : NoneType
I0605 17:37:17.636949 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.gating_func : 'top2'
I0605 17:37:17.636978 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.hidden_dims : 8192
I0605 17:37:17.637008 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.ici_mesh_shape : NoneType
I0605 17:37:17.637038 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.input_dropout_prob : 0.0
I0605 17:37:17.637068 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.mask_self_attention : False
I0605 17:37:17.637098 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.mesh_axis_names : NoneType
I0605 17:37:17.637149 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.min_group_size : NoneType
I0605 17:37:17.637181 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.model_dims : 2048
I0605 17:37:17.637211 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.egch : NoneType
I0605 17:37:17.637242 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.egcm : NoneType
I0605 17:37:17.637273 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.gec : NoneType
I0605 17:37:17.637304 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.gecm : NoneType
I0605 17:37:17.637334 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.gecs : NoneType
I0605 17:37:17.637365 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.gs : NoneType
I0605 17:37:17.637395 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.gsec : NoneType
I0605 17:37:17.637425 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.gsm : NoneType
I0605 17:37:17.637456 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.637486 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.637517 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.cls : type/praxis.layers.activations/ReLU
I0605 17:37:17.637548 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.637578 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.637609 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.637638 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.fprop_dtype : NoneType
I0605 17:37:17.637669 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.637699 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.mesh_axis_names : NoneType
I0605 17:37:17.637729 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.name : NoneType
I0605 17:37:17.637759 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.637790 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.params_init.method : 'xavier'
I0605 17:37:17.637820 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.params_init.scale : 1.000001
I0605 17:37:17.637851 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.637882 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.637912 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.activation_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.637943 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.add_skip_connection : True
I0605 17:37:17.637974 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.apply_padding_first : False
I0605 17:37:17.638004 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.cls : type/praxis.layers.transformers/TransformerFeedForwardMoe
I0605 17:37:17.638034 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.638064 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.638095 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.638125 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.expert_capacity_dim : 0
I0605 17:37:17.638155 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.expert_weight_shards : 1
I0605 17:37:17.638185 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.explicit_fan_in_fan_out_axes : False
I0605 17:37:17.638216 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.fprop_dtype : NoneType
I0605 17:37:17.638245 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.gating_func : 'top2'
I0605 17:37:17.638275 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.gating_logit_cap : 0.0
I0605 17:37:17.638305 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.hidden_dims : 0
I0605 17:37:17.638335 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.638365 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.input_dims : 0
I0605 17:37:17.638394 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.internal_gshard_variance_scaling_fan_in_init : True
I0605 17:37:17.638424 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.638454 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.cls : type/praxis.layers.normalizations/LayerNorm
I0605 17:37:17.638485 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.638515 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.638545 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.dim : 0
I0605 17:37:17.638575 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.638604 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.epsilon : 1e-06
I0605 17:37:17.638634 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.fprop_dtype : NoneType
I0605 17:37:17.638664 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.638695 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.mesh_axis_names : NoneType
I0605 17:37:17.638724 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.name : NoneType
I0605 17:37:17.638754 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.638784 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.params_init.method : 'xavier'
I0605 17:37:17.638813 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.params_init.scale : 1.000001
I0605 17:37:17.638843 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.reductions_in_fp32 : False
I0605 17:37:17.638873 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.638903 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.638933 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.use_bias : True
I0605 17:37:17.638963 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.use_scale : True
I0605 17:37:17.638993 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.ln_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.639023 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.mesh_axis_names : NoneType
I0605 17:37:17.639053 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.min_group_size : NoneType
I0605 17:37:17.639083 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.moe_gating_embedding_level : 'token'
I0605 17:37:17.639112 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.moe_load_balance_loss_weight : 1.0
I0605 17:37:17.639142 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.name : NoneType
I0605 17:37:17.639172 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.norm_policy : 'pre'
I0605 17:37:17.639203 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.num_experts : 0
I0605 17:37:17.639233 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.num_groups : 0
I0605 17:37:17.639262 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.639292 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.params_init.method : 'xavier'
I0605 17:37:17.639322 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.params_init.scale : 1.000001
I0605 17:37:17.639352 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_prob : 0.0
I0605 17:37:17.639382 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.639413 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.cls : type/praxis.layers.stochastics/Dropout
I0605 17:37:17.639443 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.639473 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.639504 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.dropout_at_eval : False
I0605 17:37:17.639533 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.639564 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.fprop_dtype : NoneType
I0605 17:37:17.639594 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.639624 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.keep_prob : 1.0
I0605 17:37:17.639654 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.mesh_axis_names : NoneType
I0605 17:37:17.639684 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.name : NoneType
I0605 17:37:17.639714 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.noise_shape : NoneType
I0605 17:37:17.639744 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.noise_shape_broadcast_dims : NoneType
I0605 17:37:17.639774 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.639804 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.params_init.method : 'xavier'
I0605 17:37:17.639834 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.params_init.scale : 1.000001
I0605 17:37:17.639864 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.639894 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.639924 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.transpose_qk : False
I0605 17:37:17.639954 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.relu_dropout_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.639984 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_prob : 0.0
I0605 17:37:17.640014 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.640045 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.cls : type/praxis.layers.stochastics/Dropout
I0605 17:37:17.640075 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.640106 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.640136 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.dropout_at_eval : False
I0605 17:37:17.640166 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.640196 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.fprop_dtype : NoneType
I0605 17:37:17.640226 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.640257 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.keep_prob : 1.0
I0605 17:37:17.640287 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.mesh_axis_names : NoneType
I0605 17:37:17.640317 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.name : NoneType
I0605 17:37:17.640347 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.noise_shape : NoneType
I0605 17:37:17.640377 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.noise_shape_broadcast_dims : NoneType
I0605 17:37:17.640407 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.640438 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.params_init.method : 'xavier'
I0605 17:37:17.640468 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.params_init.scale : 1.000001
I0605 17:37:17.640498 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.640528 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.640558 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.transpose_qk : False
I0605 17:37:17.640588 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_dropout_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.640617 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_droppath_prob : 0.0
I0605 17:37:17.640647 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.residual_weight : 1.0
I0605 17:37:17.640677 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.second_expert_policy : 'all'
I0605 17:37:17.640707 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.640737 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.640768 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.unadjusted_expert_capacity_factor : 2.0
I0605 17:37:17.640797 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.use_gated_activation : False
I0605 17:37:17.640828 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.weight_split_dims_mapping.ehm : NoneType
I0605 17:37:17.640858 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.weight_split_dims_mapping.emh : NoneType
I0605 17:37:17.640888 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.weight_split_dims_mapping.me : NoneType
I0605 17:37:17.640918 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.moe_layer_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.640948 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.name : NoneType
I0605 17:37:17.640979 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.ngrammer_tpls : NoneType
I0605 17:37:17.641009 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.num_experts : 0
I0605 17:37:17.641039 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.num_groups : 1
I0605 17:37:17.641069 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.num_heads : 32
I0605 17:37:17.641099 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.num_layers : 1
I0605 17:37:17.641137 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.packed_input : False
I0605 17:37:17.641168 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.641199 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.params_init.method : 'xavier'
I0605 17:37:17.641229 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.params_init.scale : 1.000001
I0605 17:37:17.641259 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.relu_dropout_prob : NoneType
I0605 17:37:17.641289 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.residual_dropout_prob : NoneType
I0605 17:37:17.641318 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.residual_droppath_prob : 0.0
I0605 17:37:17.641348 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.shared_weight_layer_id : NoneType
I0605 17:37:17.641378 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.skip_lp_regularization : NoneType
I0605 17:37:17.641408 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.641438 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.allow_skip_cross_attention : False
I0605 17:37:17.641468 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.atten_dropout_prob : 0.0
I0605 17:37:17.641499 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.cls : type/praxis.layers.transformers/Transformer
I0605 17:37:17.641529 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.641559 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.cross_atten_tpl : NoneType
I0605 17:37:17.641589 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.641619 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dim_per_head : NoneType
I0605 17:37:17.641650 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.641684 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.cls : type/praxis.layers.stochastics/Dropout
I0605 17:37:17.641715 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.641745 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.641775 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.dropout_at_eval : False
I0605 17:37:17.641805 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.641834 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.fprop_dtype : NoneType
I0605 17:37:17.641865 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.641894 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.keep_prob : 1.0
I0605 17:37:17.641924 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.mesh_axis_names : NoneType
I0605 17:37:17.641954 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.name : NoneType
I0605 17:37:17.641984 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.noise_shape : NoneType
I0605 17:37:17.642015 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.noise_shape_broadcast_dims : NoneType
I0605 17:37:17.642046 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.642076 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.params_init.method : 'xavier'
I0605 17:37:17.642107 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.params_init.scale : 1.000001
I0605 17:37:17.642137 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.642167 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.642197 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.transpose_qk : False
I0605 17:37:17.642227 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dropout_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.642257 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.642287 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.fprop_dtype : NoneType
I0605 17:37:17.642318 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.hidden_dims : 0
I0605 17:37:17.642349 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.642379 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.input_dims : 0
I0605 17:37:17.642410 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.642441 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.cls : type/praxis.layers.normalizations/LayerNorm
I0605 17:37:17.642472 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.642502 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.642532 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.dim : 0
I0605 17:37:17.642563 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.642593 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.epsilon : 1e-06
I0605 17:37:17.642624 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.fprop_dtype : NoneType
I0605 17:37:17.642654 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.642684 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.mesh_axis_names : NoneType
I0605 17:37:17.642715 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.name : NoneType
I0605 17:37:17.642745 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.642776 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.params_init.method : 'xavier'
I0605 17:37:17.642806 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.params_init.scale : 1.000001
I0605 17:37:17.642836 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.reductions_in_fp32 : False
I0605 17:37:17.642866 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.642896 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.642927 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.use_bias : True
I0605 17:37:17.642957 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.use_scale : True
I0605 17:37:17.642987 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ln_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.643017 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.mesh_axis_names : NoneType
I0605 17:37:17.643048 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.name : NoneType
I0605 17:37:17.643078 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.ngrammer_tpl : NoneType
I0605 17:37:17.643109 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.norm_policy : 'pre'
I0605 17:37:17.643139 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.num_heads : NoneType
I0605 17:37:17.643170 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.packed_input : False
I0605 17:37:17.643200 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.643230 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.params_init.method : 'xavier'
I0605 17:37:17.643261 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.params_init.scale : 1.000001
I0605 17:37:17.643291 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.relu_dropout_prob : 0.0
I0605 17:37:17.643321 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.residual_dropout_prob : 0.0
I0605 17:37:17.643351 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.residual_droppath_prob : 0.0
I0605 17:37:17.643382 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.643412 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.643442 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.activation_split_dims_mapping.bld : NoneType
I0605 17:37:17.643473 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.activation_split_dims_mapping.blnh : NoneType
I0605 17:37:17.643503 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.643534 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.atten_dropout_prob : 0.0
I0605 17:37:17.643564 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.atten_logit_cap : 50.0
I0605 17:37:17.643594 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.attention_extra_logit : NoneType
I0605 17:37:17.643625 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.attention_mask_summary : False
I0605 17:37:17.643655 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.cast_rotary_position_emb : True
I0605 17:37:17.643687 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.cls : type/praxis.layers.attentions/DotProductAttention
I0605 17:37:17.643719 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combine_qkv : True
I0605 17:37:17.643751 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.643784 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.attention_combine_dims : False
I0605 17:37:17.643817 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.cls : type/praxis.layers.attentions/CombinedQKVProjectionLayer
I0605 17:37:17.643851 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.643883 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.643916 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.dim_per_head : 0
I0605 17:37:17.643948 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.643980 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.644013 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.644045 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.644078 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.644111 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.644143 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.644176 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.644207 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.644239 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.name : NoneType
I0605 17:37:17.644272 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.644304 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.644336 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.644369 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.644401 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.644433 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.644466 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.explicit_fan_in_fan_out_axes : False
I0605 17:37:17.644498 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.fprop_dtype : NoneType
I0605 17:37:17.644530 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.644562 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.input_dim : 0
I0605 17:37:17.644595 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.mesh_axis_names : NoneType
I0605 17:37:17.644627 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.name : NoneType
I0605 17:37:17.644659 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.num_heads : 0
I0605 17:37:17.644696 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.644729 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.params_init.method : 'xavier'
I0605 17:37:17.644761 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.params_init.scale : 1.000001
I0605 17:37:17.644793 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.644824 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.644855 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.use_bias : True
I0605 17:37:17.644885 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.combined_qkv_proj_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.644916 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.644946 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.644976 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dconv_kernel_size : 3
I0605 17:37:17.645007 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dconv_qkv : False
I0605 17:37:17.645037 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.decode_cache : True
I0605 17:37:17.645068 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dim_per_head : NoneType
I0605 17:37:17.645099 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.645147 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.cls : type/praxis.layers.stochastics/Dropout
I0605 17:37:17.645180 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.645210 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.645241 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.dropout_at_eval : False
I0605 17:37:17.645272 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.645303 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.fprop_dtype : NoneType
I0605 17:37:17.645333 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.645364 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.keep_prob : 1.0
I0605 17:37:17.645395 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.mesh_axis_names : NoneType
I0605 17:37:17.645425 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.name : NoneType
I0605 17:37:17.645455 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.noise_shape : NoneType
I0605 17:37:17.645486 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.noise_shape_broadcast_dims : NoneType
I0605 17:37:17.645516 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.645547 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.params_init.method : 'xavier'
I0605 17:37:17.645577 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.params_init.scale : 1.000001
I0605 17:37:17.645608 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.645639 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.645669 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.transpose_qk : False
I0605 17:37:17.645700 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dropout_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.645731 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.645761 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.fprop_dtype : NoneType
I0605 17:37:17.645791 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.hidden_dim : 0
I0605 17:37:17.645821 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.645852 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.input_dim : 0
I0605 17:37:17.645882 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.internal_enable_per_dim_scale : True
I0605 17:37:17.645913 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.internal_enable_query_scale : True
I0605 17:37:17.645943 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.internal_gshard_gaussian_init : False
I0605 17:37:17.645973 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.mesh_axis_names : NoneType
I0605 17:37:17.646003 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.name : NoneType
I0605 17:37:17.646034 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.ngrammer_tpl : NoneType
I0605 17:37:17.646064 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.num_heads : 1
I0605 17:37:17.646094 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.output_proj_use_nhd_shape : False
I0605 17:37:17.646125 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.646155 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.params_init.method : 'xavier'
I0605 17:37:17.646185 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.params_init.scale : 1.000001
I0605 17:37:17.646216 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.646246 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.attention_combine_dims : False
I0605 17:37:17.646277 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.cls : type/praxis.layers.attentions/AttentionProjection
I0605 17:37:17.646307 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.646338 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.646369 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.dim_per_head : 0
I0605 17:37:17.646399 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.646430 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.646461 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.646491 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.646521 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.646552 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.646582 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.646613 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.646643 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.646673 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.name : NoneType
I0605 17:37:17.646703 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.646734 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.646764 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.646795 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.646825 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.646855 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.646886 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.explicit_fan_in_fan_out_axes : False
I0605 17:37:17.646916 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.fprop_dtype : NoneType
I0605 17:37:17.646946 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.646977 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.input_dim : 0
I0605 17:37:17.647007 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.is_output_projection : False
I0605 17:37:17.647037 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.mesh_axis_names : NoneType
I0605 17:37:17.647068 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.name : NoneType
I0605 17:37:17.647098 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.num_heads : 0
I0605 17:37:17.647128 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.647159 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.params_init.method : 'xavier'
I0605 17:37:17.647189 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.params_init.scale : 1.000001
I0605 17:37:17.647220 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.647250 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.647281 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.use_bias : True
I0605 17:37:17.647312 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.use_nhd_shape : False
I0605 17:37:17.647342 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.proj_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.647373 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.647404 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.647434 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.647465 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.647496 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.647526 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.647557 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.647588 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.647619 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.name : NoneType
I0605 17:37:17.647651 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.647684 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.647715 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.647746 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.647777 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.647808 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.pv_einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.647838 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.647868 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.647899 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.647929 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.647960 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.647990 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.648021 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.648051 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.648082 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.name : NoneType
I0605 17:37:17.648113 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.648143 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.648173 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.648204 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.648234 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.648265 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.qk_einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.648295 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.relative_bias_tpl : NoneType
I0605 17:37:17.648326 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.648357 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.cast_as_fprop_dtype : True
I0605 17:37:17.648388 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.cls : type/praxis.layers.embedding_softmax/RotaryPositionalEmbedding
I0605 17:37:17.648419 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.648449 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.648479 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.648510 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.embedding_dims : 0
I0605 17:37:17.648540 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.fprop_dtype : NoneType
I0605 17:37:17.648571 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.648601 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.max_timescale : 10000
I0605 17:37:17.648632 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.mesh_axis_names : NoneType
I0605 17:37:17.648663 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.min_timescale : 1
I0605 17:37:17.648693 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.name : NoneType
I0605 17:37:17.648724 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.648755 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.params_init.method : 'xavier'
I0605 17:37:17.648786 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.params_init.scale : 1.000001
I0605 17:37:17.648816 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.648847 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.648877 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.rotary_position_emb_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.648907 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.scale_logits_by_head_dims : False
I0605 17:37:17.648937 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.648968 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.648998 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.use_bias : False
I0605 17:37:17.649029 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.use_rotary_position_emb : False
I0605 17:37:17.649059 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.weight_split_dims_mapping.dconv : NoneType
I0605 17:37:17.649090 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.weight_split_dims_mapping.proj : NoneType
I0605 17:37:17.649127 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.649159 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_atten_tpl.zero_fully_masked : False
I0605 17:37:17.649190 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_split_dims_mapping.ffn0 : NoneType
I0605 17:37:17.649220 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_split_dims_mapping.ffn1 : NoneType
I0605 17:37:17.649250 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.649281 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.649312 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.approximate : True
I0605 17:37:17.649342 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.cls : type/praxis.layers.activations/GELU
I0605 17:37:17.649373 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.649403 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.649433 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.649464 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.fprop_dtype : NoneType
I0605 17:37:17.649494 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.649525 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.mesh_axis_names : NoneType
I0605 17:37:17.649555 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.name : NoneType
I0605 17:37:17.649586 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.649616 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.params_init.method : 'xavier'
I0605 17:37:17.649647 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.params_init.scale : 1.000001
I0605 17:37:17.649677 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.649708 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.649739 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.activation_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.649769 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.add_skip_connection : True
I0605 17:37:17.649799 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.apply_padding_first : False
I0605 17:37:17.649830 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.cls : type/praxis.layers.transformers/TransformerFeedForward
I0605 17:37:17.649861 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.649891 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.649922 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.649952 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.649982 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.650013 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.cls : type/praxis.layers.activations/ReLU
I0605 17:37:17.650044 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.650074 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.650104 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.650135 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.fprop_dtype : NoneType
I0605 17:37:17.650165 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.650195 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.mesh_axis_names : NoneType
I0605 17:37:17.650226 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.name : NoneType
I0605 17:37:17.650256 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.650288 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.params_init.method : 'xavier'
I0605 17:37:17.650318 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.params_init.scale : 1.000001
I0605 17:37:17.650348 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.650379 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.650409 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.activation_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.650440 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.bias_init : 0.0
I0605 17:37:17.650470 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.cls : type/praxis.layers.linears/FeedForward
I0605 17:37:17.650501 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.650531 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.650561 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.650592 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.fprop_dtype : NoneType
I0605 17:37:17.650622 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.has_bias : True
I0605 17:37:17.650655 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.650688 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.input_dims : 0
I0605 17:37:17.650718 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.650749 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.cls : type/praxis.layers.linears/Linear
I0605 17:37:17.650779 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.650809 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.650840 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.650871 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.650901 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.cls : type/praxis.layers.base_ops/Einsum
I0605 17:37:17.650932 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.650963 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.650994 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.651025 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.fprop_dtype : NoneType
I0605 17:37:17.651055 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.651086 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.mesh_axis_names : NoneType
I0605 17:37:17.651116 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.name : NoneType
I0605 17:37:17.651147 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.651178 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.params_init.method : 'xavier'
I0605 17:37:17.651209 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.params_init.scale : 1.000001
I0605 17:37:17.651240 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.651270 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.651301 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.einsum_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.651332 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.fprop_dtype : NoneType
I0605 17:37:17.651362 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.651392 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.input_dims : 0
I0605 17:37:17.651423 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.mesh_axis_names : NoneType
I0605 17:37:17.651454 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.name : NoneType
I0605 17:37:17.651484 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.output_dims : 0
I0605 17:37:17.651514 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.651545 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.params_init.method : 'xavier'
I0605 17:37:17.651576 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.params_init.scale : 1.000001
I0605 17:37:17.651606 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.651637 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.651667 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.weight_init : NoneType
I0605 17:37:17.651697 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.linear_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.651728 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.mesh_axis_names : NoneType
I0605 17:37:17.651758 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.name : NoneType
I0605 17:37:17.651789 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.output_dims : 0
I0605 17:37:17.651820 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.651850 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.params_init.method : 'xavier'
I0605 17:37:17.651881 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.params_init.scale : 1.000001
I0605 17:37:17.651912 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.651942 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.651973 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.weight_init : NoneType
I0605 17:37:17.652003 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fflayer_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.652033 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.fprop_dtype : NoneType
I0605 17:37:17.652064 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.has_bias : True
I0605 17:37:17.652095 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.hidden_dims : 0
I0605 17:37:17.652125 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.652156 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.input_dims : 0
I0605 17:37:17.652187 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.internal_gshard_variance_scaling_fan_in_init : False
I0605 17:37:17.652217 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.652247 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.cls : type/praxis.layers.normalizations/LayerNorm
I0605 17:37:17.652277 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.652308 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.652339 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.dim : 0
I0605 17:37:17.652369 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.652400 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.epsilon : 1e-06
I0605 17:37:17.652431 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.fprop_dtype : NoneType
I0605 17:37:17.652461 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.652492 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.mesh_axis_names : NoneType
I0605 17:37:17.652522 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.name : NoneType
I0605 17:37:17.652552 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.652583 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.params_init.method : 'xavier'
I0605 17:37:17.652613 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.params_init.scale : 1.000001
I0605 17:37:17.652643 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.reductions_in_fp32 : False
I0605 17:37:17.652673 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.652704 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.652735 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.use_bias : True
I0605 17:37:17.652765 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.use_scale : True
I0605 17:37:17.652795 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.ln_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.652826 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.mesh_axis_names : NoneType
I0605 17:37:17.652857 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.name : NoneType
I0605 17:37:17.652887 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.norm_policy : 'pre'
I0605 17:37:17.652917 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.output_dims : 0
I0605 17:37:17.652948 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.652978 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.params_init.method : 'xavier'
I0605 17:37:17.653009 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.params_init.scale : 1.000001
I0605 17:37:17.653039 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_prob : 0.0
I0605 17:37:17.653069 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.653105 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.cls : type/praxis.layers.stochastics/Dropout
I0605 17:37:17.653138 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.653170 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.653200 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.dropout_at_eval : False
I0605 17:37:17.653231 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.653262 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.fprop_dtype : NoneType
I0605 17:37:17.653292 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.653323 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.keep_prob : 1.0
I0605 17:37:17.653354 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.mesh_axis_names : NoneType
I0605 17:37:17.653384 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.name : NoneType
I0605 17:37:17.653415 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.noise_shape : NoneType
I0605 17:37:17.653446 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.noise_shape_broadcast_dims : NoneType
I0605 17:37:17.653477 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.653507 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.params_init.method : 'xavier'
I0605 17:37:17.653537 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.params_init.scale : 1.000001
I0605 17:37:17.653568 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.653599 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.653630 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.transpose_qk : False
I0605 17:37:17.653660 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.relu_dropout_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.653691 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_prob : 0.0
I0605 17:37:17.653725 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.activation_split_dims_mapping.out : NoneType
I0605 17:37:17.653755 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.cls : type/praxis.layers.stochastics/Dropout
I0605 17:37:17.653786 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.653817 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.653847 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.dropout_at_eval : False
I0605 17:37:17.653878 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.653909 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.fprop_dtype : NoneType
I0605 17:37:17.653939 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.653970 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.keep_prob : 1.0
I0605 17:37:17.654000 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.mesh_axis_names : NoneType
I0605 17:37:17.654031 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.name : NoneType
I0605 17:37:17.654061 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.noise_shape : NoneType
I0605 17:37:17.654092 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.noise_shape_broadcast_dims : NoneType
I0605 17:37:17.654122 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.654153 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.params_init.method : 'xavier'
I0605 17:37:17.654186 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.params_init.scale : 1.000001
I0605 17:37:17.654217 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.654248 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.654278 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.transpose_qk : False
I0605 17:37:17.654308 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_dropout_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.654339 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_droppath_prob : 0.0
I0605 17:37:17.654369 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.residual_weight : 1.0
I0605 17:37:17.654399 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.654429 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.654459 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.use_gated_activation : False
I0605 17:37:17.654489 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.weight_split_dims_mapping.ffn0 : NoneType
I0605 17:37:17.654520 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.weight_split_dims_mapping.ffn1 : NoneType
I0605 17:37:17.654550 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.tr_fflayer_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.654581 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.use_cross_attention : False
I0605 17:37:17.654611 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.transformer_layer_params_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.654642 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.unadjusted_expert_capacity_factor : 2.0
I0605 17:37:17.654674 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.use_cross_attention : False
I0605 17:37:17.654704 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.block.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.654735 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.checkpoint_policy : 'save_nothing'
I0605 17:37:17.654766 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.cls : type/praxis.layers.transformers/StackedTransformerRepeated
I0605 17:37:17.654796 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.contiguous_submeshes : NoneType
I0605 17:37:17.654826 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.dcn_mesh_shape : NoneType
I0605 17:37:17.654856 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.dtype : type/jax.numpy/float32
I0605 17:37:17.654886 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.fprop_dtype : NoneType
I0605 17:37:17.654917 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.ici_mesh_shape : NoneType
I0605 17:37:17.654947 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.mesh_axis_names : NoneType
I0605 17:37:17.654977 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.name : NoneType
I0605 17:37:17.655007 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.nd_prefix_shape : NoneType
I0605 17:37:17.655038 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.params_init.cls : type/praxis.base_layer/WeightInit
I0605 17:37:17.655068 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.params_init.method : 'xavier'
I0605 17:37:17.655098 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.params_init.scale : 1.000001
I0605 17:37:17.655129 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.repeat_layer_name : 'repeat'
I0605 17:37:17.655158 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.repeat_optimizer_dims_mapping : NoneType
I0605 17:37:17.655188 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.shared_weight_layer_id : NoneType
I0605 17:37:17.655219 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.skip_lp_regularization : NoneType
I0605 17:37:17.655249 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.sublayer_name : 'sub'
I0605 17:37:17.655279 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.unroll_in_decode : True
I0605 17:37:17.655310 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.weight_split_dims_mapping.block : NoneType
I0605 17:37:17.655340 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.655370 139756486557120 train.py:171] model.lm_tpl.stacked_transformer_tpl.x_times : 24
I0605 17:37:17.655400 139756486557120 train.py:171] model.lm_tpl.vocab_size : 51200
I0605 17:37:17.655430 139756486557120 train.py:171] model.lm_tpl.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.655461 139756486557120 train.py:171] model.mesh_axis_names : NoneType
I0605 17:37:17.655491 139756486557120 train.py:171] model.model_type : 'causal'
I0605 17:37:17.655521 139756486557120 train.py:171] model.name : 'xformer_lm'
I0605 17:37:17.655550 139756486557120 train.py:171] model.params_init.method : 'gaussian'
I0605 17:37:17.655580 139756486557120 train.py:171] model.params_init.scale : 0.023
I0605 17:37:17.655610 139756486557120 train.py:171] model.report_strict_acc : False
I0605 17:37:17.655640 139756486557120 train.py:171] model.return_predictions : False
I0605 17:37:17.655670 139756486557120 train.py:171] model.shared_weight_layer_id : NoneType
I0605 17:37:17.655700 139756486557120 train.py:171] model.skip_lp_regularization : NoneType
I0605 17:37:17.655730 139756486557120 train.py:171] model.weight_split_dims_mapping.wt : NoneType
I0605 17:37:17.655761 139756486557120 train.py:171] name : 'xformer_task'
I0605 17:37:17.655791 139756486557120 train.py:171] summary_verbosity : 3
I0605 17:37:17.655821 139756486557120 train.py:171] train.always_use_train_for_model_init : True
I0605 17:37:17.655851 139756486557120 train.py:171] train.apply_mutable_list : ['aux_loss', 'summaries', 'non_trainable', 'batch_stats', 'params_axes']
I0605 17:37:17.655882 139756486557120 train.py:171] train.async_summary_writing : True
I0605 17:37:17.655912 139756486557120 train.py:171] train.cls : type/paxml.tasks_lib/SingleTask.Train
I0605 17:37:17.655942 139756486557120 train.py:171] train.decode_interval_steps : NoneType
I0605 17:37:17.655972 139756486557120 train.py:171] train.decode_start_after_n_steps : 0
I0605 17:37:17.656003 139756486557120 train.py:171] train.decode_use_ema_states : False
I0605 17:37:17.656033 139756486557120 train.py:171] train.device_sync_interval_steps : NoneType
I0605 17:37:17.656064 139756486557120 train.py:171] train.enable_input_checkpointing : False
I0605 17:37:17.656093 139756486557120 train.py:171] train.enforce_input_specs : False
I0605 17:37:17.656123 139756486557120 train.py:171] train.eval_interval_steps : 100
I0605 17:37:17.656153 139756486557120 train.py:171] train.eval_skip_train : False
I0605 17:37:17.656183 139756486557120 train.py:171] train.eval_use_ema_states : False
I0605 17:37:17.656213 139756486557120 train.py:171] train.external_checkpoint_handler : NoneType
I0605 17:37:17.656244 139756486557120 train.py:171] train.external_checkpoint_path : NoneType
I0605 17:37:17.656274 139756486557120 train.py:171] train.inputs_split_mapping : NoneType
I0605 17:37:17.656304 139756486557120 train.py:171] train.learner.check_valid_step : True
I0605 17:37:17.656334 139756486557120 train.py:171] train.learner.cls : type/paxml.learners/Learner
I0605 17:37:17.656365 139756486557120 train.py:171] train.learner.enable_skip_step_on_gradient_anomalies : True
I0605 17:37:17.656395 139756486557120 train.py:171] train.learner.force_repeat_prefix_structure : False
I0605 17:37:17.656425 139756486557120 train.py:171] train.learner.grad_norm_individual_vars : False
I0605 17:37:17.656456 139756486557120 train.py:171] train.learner.grad_norm_summary : True
I0605 17:37:17.656486 139756486557120 train.py:171] train.learner.keep_optimizer_state_for_excluded_vars : False
I0605 17:37:17.656517 139756486557120 train.py:171] train.learner.loss_name : 'total_loss'
I0605 17:37:17.656547 139756486557120 train.py:171] train.learner.name : ''
I0605 17:37:17.656577 139756486557120 train.py:171] train.learner.optimizer.beta1 : 0.9
I0605 17:37:17.656607 139756486557120 train.py:171] train.learner.optimizer.beta2 : 0.95
I0605 17:37:17.656638 139756486557120 train.py:171] train.learner.optimizer.clip_gradient_norm_to_value : 1.0
I0605 17:37:17.656668 139756486557120 train.py:171] train.learner.optimizer.clip_gradient_single_norm_to_value : 0.0
I0605 17:37:17.656698 139756486557120 train.py:171] train.learner.optimizer.clip_threshold : 1.0
I0605 17:37:17.656729 139756486557120 train.py:171] train.learner.optimizer.cls : type/praxis.optimizers/Adam
I0605 17:37:17.656759 139756486557120 train.py:171] train.learner.optimizer.decoupled_weight_decay : NoneType
I0605 17:37:17.656790 139756486557120 train.py:171] train.learner.optimizer.ema_decay : 0.0
I0605 17:37:17.656820 139756486557120 train.py:171] train.learner.optimizer.epsilon : 1e-08
I0605 17:37:17.656850 139756486557120 train.py:171] train.learner.optimizer.epsilon_root : 0.0
I0605 17:37:17.656880 139756486557120 train.py:171] train.learner.optimizer.ewc_regularizer_weight : 0.0
I0605 17:37:17.656910 139756486557120 train.py:171] train.learner.optimizer.ewc_weight_per_var : NoneType
I0605 17:37:17.656940 139756486557120 train.py:171] train.learner.optimizer.l1_regularizer_weight : NoneType
I0605 17:37:17.656970 139756486557120 train.py:171] train.learner.optimizer.l2_regularizer_weight : NoneType
I0605 17:37:17.657001 139756486557120 train.py:171] train.learner.optimizer.learning_rate : 0.0006
I0605 17:37:17.657031 139756486557120 train.py:171] train.learner.optimizer.lr_schedule.cls : type/praxis.schedules/LinearRampupCosineDecay
I0605 17:37:17.657062 139756486557120 train.py:171] train.learner.optimizer.lr_schedule.decay_end : 500000
I0605 17:37:17.657092 139756486557120 train.py:171] train.learner.optimizer.lr_schedule.decay_start : 1
I0605 17:37:17.657137 139756486557120 train.py:171] train.learner.optimizer.lr_schedule.max : 1.0
I0605 17:37:17.657169 139756486557120 train.py:171] train.learner.optimizer.lr_schedule.min_ratio : 0.1
I0605 17:37:17.657199 139756486557120 train.py:171] train.learner.optimizer.lr_schedule.name : ''
I0605 17:37:17.657230 139756486557120 train.py:171] train.learner.optimizer.lr_schedule.warmup_steps : 0
I0605 17:37:17.657260 139756486557120 train.py:171] train.learner.optimizer.maybe_inf_to_nan : True
I0605 17:37:17.657290 139756486557120 train.py:171] train.learner.optimizer.name : ''
I0605 17:37:17.657320 139756486557120 train.py:171] train.learner.optimizer.sharded_adam : True
I0605 17:37:17.657351 139756486557120 train.py:171] train.learner.optimizer.skip_lp_1d_vectors : False
I0605 17:37:17.657380 139756486557120 train.py:171] train.learner.optimizer.weight_decay : 0.001
I0605 17:37:17.657411 139756486557120 train.py:171] train.learner.repeat_prefix_sep : '#'
I0605 17:37:17.657441 139756486557120 train.py:171] train.learner.skip_step_gradient_norm_value : 0.0
I0605 17:37:17.657471 139756486557120 train.py:171] train.learner.skip_zero_gradients : NoneType
I0605 17:37:17.657502 139756486557120 train.py:171] train.learner.stochastic_gradient : NoneType
I0605 17:37:17.657532 139756486557120 train.py:171] train.learner.var_norm_summary : True
I0605 17:37:17.657562 139756486557120 train.py:171] train.learner.vectorize_on_repeat_prefix : True
I0605 17:37:17.657592 139756486557120 train.py:171] train.log_train_output_interval_steps : NoneType
I0605 17:37:17.657623 139756486557120 train.py:171] train.max_inflight_steps : 2
I0605 17:37:17.657653 139756486557120 train.py:171] train.num_train_steps : 10000000.0
I0605 17:37:17.657683 139756486557120 train.py:171] train.profiler_capture_step : NoneType
I0605 17:37:17.657713 139756486557120 train.py:171] train.profiler_max_num_hosts : NoneType
I0605 17:37:17.657743 139756486557120 train.py:171] train.profiler_min_duration_sec : 1
I0605 17:37:17.657773 139756486557120 train.py:171] train.profiler_num_steps : 2
I0605 17:37:17.657804 139756486557120 train.py:171] train.random_seed : 1234
I0605 17:37:17.657834 139756486557120 train.py:171] train.restore_transformations : NoneType
I0605 17:37:17.657865 139756486557120 train.py:171] train.save_interval_steps : 100000
I0605 17:37:17.657895 139756486557120 train.py:171] train.save_keep_interval_duration : '12h'
I0605 17:37:17.657925 139756486557120 train.py:171] train.save_max_to_keep : 10
I0605 17:37:17.657955 139756486557120 train.py:171] train.summary_accumulate_interval_steps : NoneType
I0605 17:37:17.657985 139756486557120 train.py:171] train.summary_interval_steps : 100
I0605 17:37:17.658015 139756486557120 train.py:171] train.tensorstore_metadata_key : NoneType
I0605 17:37:17.658048 139756486557120 train.py:171] train.variable_norm_summary : True
I0605 17:37:17.658079 139756486557120 train.py:171] vn.cls : type/paxml.tasks_lib/SingleTask.VariationalNoise
I0605 17:37:17.658113 139756486557120 train.py:171] vn.vn_regex : ''
I0605 17:37:17.658157 139756486557120 train.py:171] vn.vn_scale : 0.0
I0605 17:37:17.658191 139756486557120 train.py:171] vn.vn_start_step : 0
I0605 17:37:17.658233 139756486557120 train.py:173] [PAX STATUS]: Initializing decoder
I0605 17:37:17.658369 139756486557120 checkpoint_creators.py:564] [PAX STATUS]: Creating checkpointer.
I0605 17:37:17.658570 139756486557120 py_utils.py:338] Starting sync_global_devices checkpointer:makedirs:log_NVIDIA1_3BPmap/checkpoints across 1 devices globally
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0000d1b00 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:0}, signal={0x6060001682c0:1} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:1}, signal={0x6060001682c0:2} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000167a80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000167960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000167a80, semaphore=0x6060001682c0, value=2 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000167960, semaphore=0x6060001682c0, value=2 (OK)
W0605 17:37:17.731722 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0006318092346191406 sec
W0605 17:37:17.732191 139756486557120 dispatch.py:272] Finished tracing + transforming _psum for pjit in 0.0015490055084228516 sec
W0605 17:37:17.733086 139756486557120 pxla.py:1882] Compiling _psum for with global shapes and types [ShapedArray(uint32[1])]. Argument mapping: (GSPMDSharding({maximal device=0}),).
W0605 17:37:17.738058 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_psum) in 0.004811763763427734 sec
W0605 17:37:17.966061 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_psum) in 0.22601723670959473 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000203720 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000203720, semaphore=0x6060001683e0, value=0 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=1, fence=0x60400081fad0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000203720, from_fence=0x606000167a80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000167960, semaphore=0x6060001683e0, value=1 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x606000203720 {0x6060001683e0:0, 0x6060001682c0:2}, signal_fence=0x60400081fad0 {0x6060001683e0:1} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000220ee0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000221300 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000220ee0, semaphore=0x6060001683e0, value=1 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000221300, semaphore=0x6060001683e0, value=1 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc14a40 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:1}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0000d1c80, wait={0x6060001682c0:2, 0x6060001683e0:1}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0000d1c80, wait={0x6060001683e0:1}, signal={} (OK)
I0605 17:37:17.968034 139756486557120 py_utils.py:341] Finished sync_global_devices checkpointer:makedirs:log_NVIDIA1_3BPmap/checkpoints across 1 devices globally
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0000a5a00 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:2}, signal={0x6060001682c0:3} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:3}, signal={0x6060001682c0:4} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002cfe40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002cfcc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cfe40, semaphore=0x6060001682c0, value=4 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cfcc0, semaphore=0x6060001682c0, value=4 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002cec40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cec40, semaphore=0x6060001683e0, value=1 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=2, fence=0x60400081f1d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060002cec40, from_fence=0x6060002cfe40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cfcc0, semaphore=0x6060001683e0, value=2 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x6060002cec40 {0x6060001683e0:1, 0x6060001682c0:4}, signal_fence=0x60400081f1d0 {0x6060001683e0:2} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002ceb80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002ceac0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002ceb80, semaphore=0x6060001683e0, value=2 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002ceac0, semaphore=0x6060001683e0, value=2 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc14300 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:2}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0000a5ac0, wait={0x6060001682c0:4, 0x6060001683e0:2}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0000a5ac0, wait={0x6060001683e0:2}, signal={} (OK)
I0605 17:37:17.971108 139756486557120 utils.py:366] Cleaning up existing temporary directories at log_NVIDIA1_3BPmap/checkpoints.
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00010e280 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:4}, signal={0x6060001682c0:5} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:5}, signal={0x6060001682c0:6} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002cd9e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002cdb00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cd9e0, semaphore=0x6060001682c0, value=6 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cdb00, semaphore=0x6060001682c0, value=6 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002d3d40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002d3d40, semaphore=0x6060001683e0, value=2 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=3, fence=0x60400081e990 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060002d3d40, from_fence=0x6060002cd9e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cdb00, semaphore=0x6060001683e0, value=3 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x6060002d3d40 {0x6060001683e0:2, 0x6060001682c0:6}, signal_fence=0x60400081e990 {0x6060001683e0:3} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002d3ec0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002d3f20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002d3ec0, semaphore=0x6060001683e0, value=3 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002d3f20, semaphore=0x6060001683e0, value=3 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13f60 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:3}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00010df80, wait={0x6060001682c0:6, 0x6060001683e0:3}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00010df80, wait={0x6060001683e0:3}, signal={} (OK)
I0605 17:37:17.973646 139756486557120 train.py:206] [PAX STATUS]: Creating task
I0605 17:37:18.160361 139756486557120 train.py:217] [PAX STATUS]: Initializing partitioner
I0605 17:37:18.160518 139756486557120 partitioning.py:576] Using pmap for data parallelism.
I0605 17:37:18.160575 139756486557120 train.py:245] [PAX STATUS]: Creating executor.
I0605 17:37:18.160630 139756486557120 train.py:249] [PAX STATUS]: Setting up executor.
W0605 17:37:18.164295 139756486557120 dispatch.py:272] Finished tracing + transforming jit(convert_element_type) in 0.0002434253692626953 sec
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00010d740 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:6}, signal={0x6060001682c0:7} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:7}, signal={0x6060001682c0:8} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002d30e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002d3080 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002d30e0, semaphore=0x6060001682c0, value=8 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002d3080, semaphore=0x6060001682c0, value=8 (OK)
W0605 17:37:18.166568 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00041937828063964844 sec
W0605 17:37:18.167441 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_seed for pjit in 0.0018720626831054688 sec
W0605 17:37:18.168067 139756486557120 pxla.py:1882] Compiling _threefry_seed for with global shapes and types [ShapedArray(int32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:18.172546 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_threefry_seed) in 0.004347562789916992 sec
W0605 17:37:18.443973 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_threefry_seed) in 0.2711031436920166 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000a70c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000a70c0, semaphore=0x6060001683e0, value=3 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=4, fence=0x60400055c890 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060000a70c0, from_fence=0x6060002d30e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002d3080, semaphore=0x6060001683e0, value=4 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00053f590, f=0, wait_fence=0x6060000a70c0 {0x6060001683e0:3, 0x6060001682c0:8}, signal_fence=0x60400055c890 {0x6060001683e0:4} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000a6f40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000a6ee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000a6f40, semaphore=0x6060001683e0, value=4 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000a6ee0, semaphore=0x6060001683e0, value=4 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00010d500, wait={0x6060001682c0:8, 0x6060001683e0:4}, signal={} (OK)
I0605 17:37:18.445732 139756486557120 partitioning.py:420] input_p.tf_data_service_address: None
I0605 17:37:18.445963 139756486557120 executors.py:163] [PAX STATUS]: Instantiating train input pipeline.
I0605 17:37:18.449376 139756486557120 executors.py:222] [PAX STATUS]: Setting up partitioner
I0605 17:37:18.449437 139756486557120 partitioning.py:353] [PAX STATUS]: Getting input shapes from first batch.
I0605 17:37:19.157606 139756486557120 local.py:50] Created artifact Input specs of type ArtifactType.FILE and value log_NVIDIA1_3BPmap/input_specs.json.
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000164ec0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:8}, signal={0x6060001682c0:9} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:9}, signal={0x6060001682c0:10} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600040c5e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600040c640 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600040c5e0, semaphore=0x6060001682c0, value=10 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600040c640, semaphore=0x6060001682c0, value=10 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600040c700 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600040c700, semaphore=0x6060001683e0, value=4 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=5, fence=0x6040002c8ad0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600040c700, from_fence=0x60600040c5e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600040c640, semaphore=0x6060001683e0, value=5 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00053f590, f=0, wait_fence=0x60600040c700 {0x6060001683e0:4, 0x6060001682c0:10}, signal_fence=0x6040002c8ad0 {0x6060001683e0:5} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600040c820 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600040c880 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600040c820, semaphore=0x6060001683e0, value=5 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600040c880, semaphore=0x6060001683e0, value=5 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000164f80, wait={0x6060001682c0:10, 0x6060001683e0:5}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000165280, wait={0x6060001683e0:5}, signal={} (OK)
W0605 17:37:19.679517 139756486557120 optimizers.py:1170] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
I0605 17:37:19.679631 139756486557120 optimizers.py:1173] Using sharded_adam.
W0605 17:37:19.679672 139756486557120 optimizers.py:580] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
W0605 17:37:19.693978 139756486557120 optimizers.py:1170] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
I0605 17:37:19.694039 139756486557120 optimizers.py:1173] Using sharded_adam.
W0605 17:37:19.694076 139756486557120 optimizers.py:580] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
I0605 17:37:19.708143 139756486557120 trainer_lib.py:197] post_init_model_params: log_NVIDIA1_3BPmap/post_init_model_params.txt
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000177c40 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:10}, signal={0x6060001682c0:11} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:11}, signal={0x6060001682c0:12} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600046c340 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600046c3a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600046c340, semaphore=0x6060001682c0, value=12 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600046c3a0, semaphore=0x6060001682c0, value=12 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600046c460 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600046c460, semaphore=0x6060001683e0, value=5 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=6, fence=0x6040002b6590 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600046c460, from_fence=0x60600046c340 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600046c3a0, semaphore=0x6060001683e0, value=6 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00053f590, f=0, wait_fence=0x60600046c460 {0x6060001683e0:5, 0x6060001682c0:12}, signal_fence=0x6040002b6590 {0x6060001683e0:6} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600046c580 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600046c5e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600046c580, semaphore=0x6060001683e0, value=6 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600046c5e0, semaphore=0x6060001683e0, value=6 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000177d00, wait={0x6060001682c0:12, 0x6060001683e0:6}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000178000, wait={0x6060001683e0:6}, signal={} (OK)
W0605 17:37:19.902000 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00021767616271972656 sec
W0605 17:37:19.902926 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0017635822296142578 sec
W0605 17:37:19.904009 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.0032732486724853516 sec
W0605 17:37:19.904708 139756486557120 pxla.py:1882] Compiling _threefry_split_original for with global shapes and types [ShapedArray(uint32[2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:19.909348 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0005061626434326172 sec
W0605 17:37:19.910399 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003032684326171875 sec
W0605 17:37:19.911373 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00029921531677246094 sec
W0605 17:37:19.912310 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003139972686767578 sec
W0605 17:37:19.912992 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002911090850830078 sec
W0605 17:37:19.959887 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_threefry_split_original) in 0.05504131317138672 sec
W0605 17:37:21.409307 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_threefry_split_original) in 1.4490509033203125 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000258fe0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000258fe0, semaphore=0x6060001683e0, value=6 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=7, fence=0x604000e5abd0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000258fe0, from_fence=0x6060000a6f40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000a6ee0, semaphore=0x6060001683e0, value=7 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00047e130, f=0, wait_fence=0x606000258fe0 {0x6060001683e0:6}, signal_fence=0x604000e5abd0 {0x6060001683e0:7} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000258ec0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002590a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000258ec0, semaphore=0x6060001683e0, value=7 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002590a0, semaphore=0x6060001683e0, value=7 (OK)
W0605 17:37:21.413856 139756486557120 dispatch.py:272] Finished tracing + transforming _unstack for pjit in 0.0012726783752441406 sec
W0605 17:37:21.415073 139756486557120 pxla.py:1882] Compiling _unstack for with global shapes and types [ShapedArray(uint32[2,2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:21.421748 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_unstack) in 0.0064165592193603516 sec
W0605 17:37:21.630789 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_unstack) in 0.20858287811279297 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000328d00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000328d00, semaphore=0x6060001683e0, value=7 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=8, fence=0x604000cef690 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000328d00, from_fence=0x606000258ec0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002590a0, semaphore=0x6060001683e0, value=8 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000314e10, f=0, wait_fence=0x606000328d00 {0x6060001683e0:7}, signal_fence=0x604000cef690 {0x6060001683e0:8} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000328ee0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003292a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000328ee0, semaphore=0x6060001683e0, value=8 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003292a0, semaphore=0x6060001683e0, value=8 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000328e20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000329300 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000328e20, semaphore=0x6060001683e0, value=8 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000329300, semaphore=0x6060001683e0, value=8 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00035a100, wait={0x6060001683e0:8}, signal={} (OK)
W0605 17:37:21.633492 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00022339820861816406 sec
W0605 17:37:21.634345 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.001689910888671875 sec
W0605 17:37:21.635529 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.00327301025390625 sec
W0605 17:37:21.636210 139756486557120 pxla.py:1882] Compiling _threefry_split_original for with global shapes and types [ShapedArray(uint32[2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:21.641349 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.000331878662109375 sec
W0605 17:37:21.642361 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003070831298828125 sec
W0605 17:37:21.643281 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003108978271484375 sec
W0605 17:37:21.644005 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00032782554626464844 sec
W0605 17:37:21.691162 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_threefry_split_original) in 0.054818153381347656 sec
W0605 17:37:23.174265 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_threefry_split_original) in 1.4827439785003662 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003a72a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a72a0, semaphore=0x6060001683e0, value=8 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=9, fence=0x6040008b7650 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060003a72a0, from_fence=0x606000328e20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000329300, semaphore=0x6060001683e0, value=9 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0000ec8f0, f=0, wait_fence=0x6060003a72a0 {0x6060001683e0:8}, signal_fence=0x6040008b7650 {0x6060001683e0:9} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003a6940 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003a7900 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a6940, semaphore=0x6060001683e0, value=9 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a7900, semaphore=0x6060001683e0, value=9 (OK)
W0605 17:37:23.179615 139756486557120 dispatch.py:272] Finished tracing + transforming _unstack for pjit in 0.0019333362579345703 sec
W0605 17:37:23.181048 139756486557120 pxla.py:1882] Compiling _unstack for with global shapes and types [ShapedArray(uint32[4,2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:23.186113 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_unstack) in 0.004763603210449219 sec
W0605 17:37:23.414857 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_unstack) in 0.22841119766235352 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005df200 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005df200, semaphore=0x6060001683e0, value=9 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=10, fence=0x60400025bf90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060005df200, from_fence=0x6060003a6940 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a7900, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0002d79b0, f=0, wait_fence=0x6060005df200 {0x6060001683e0:9}, signal_fence=0x60400025bf90 {0x6060001683e0:10} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000193d60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060001943c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000193d60, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001943c0, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000195620 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000193460 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000195620, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000193460, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060001933a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000193340 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001933a0, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000193340, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000193b20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000193c40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000193b20, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000193c40, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0002e7bc0, wait={0x6060001683e0:10}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13ee0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:10}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13ee0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:10}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13ee0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:10}, signal={} (OK)
I0605 17:37:23.416016 139756486557120 trainer_lib.py:378] init_var prng_seed: {'params': Array([1477712937, 1244108694], dtype=uint32), 'random': Array([713085529, 937672790], dtype=uint32), 'dropout': Array([3893856254, 2733895282], dtype=uint32)}
I0605 17:37:23.417782 139756486557120 trainer_lib.py:379] var_weight_hparams: {'params': {'lm': {'final_ln': {'bias': WeightHParams(shape=[2048], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=['__lingvo_jax_skip_regularization'], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=None, repeat_prefix_split_dims_mapping=None, repeat_optimizer_dims_mapping=None, fan_in_axes=None, fan_out_axes=None), 'scale': WeightHParams(shape=[2048], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=['__lingvo_jax_skip_regularization'], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=None, repeat_prefix_split_dims_mapping=None, repeat_optimizer_dims_mapping=None, fan_in_axes=None, fan_out_axes=None)}, 'position_emb': {'emb_var': WeightHParams(shape=[2048, 2048], init=WeightInit(method='gaussian', scale=0.023), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=None, repeat_prefix_split_dims_mapping=None, repeat_optimizer_dims_mapping=None, fan_in_axes=None, fan_out_axes=None)}, 'softmax': {'logits_ffn': {'bias': {'b': WeightHParams(shape=[51200], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=None, repeat_prefix_split_dims_mapping=None, repeat_optimizer_dims_mapping=None, fan_in_axes=None, fan_out_axes=None)}, 'linear': {'w': WeightHParams(shape=[2048, 51200], init=WeightInit(method='gaussian', scale=0.022097086912079608), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=None, repeat_prefix_split_dims_mapping=None, repeat_optimizer_dims_mapping=None, fan_in_axes=None, fan_out_axes=None)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': WeightHParams(shape=[8192], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}, 'linear': {'w': WeightHParams(shape=[2048, 8192], init=WeightInit(method='gaussian', scale=0.023), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}}, 'ffn_layer2': {'bias': {'b': WeightHParams(shape=[2048], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}, 'linear': {'w': WeightHParams(shape=[8192, 2048], init=WeightInit(method='gaussian', scale=0.023), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}}, 'layer_norm': {'bias': WeightHParams(shape=[2048], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=['__lingvo_jax_skip_regularization'], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None), 'scale': WeightHParams(shape=[2048], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=['__lingvo_jax_skip_regularization'], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}}, 'layer_norm': {'bias': WeightHParams(shape=[2048], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=['__lingvo_jax_skip_regularization'], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None), 'scale': WeightHParams(shape=[2048], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=['__lingvo_jax_skip_regularization'], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}, 'self_attention': {'combined_qkv': {'w': WeightHParams(shape=[3, 2048, 32, 64], init=WeightInit(method='gaussian', scale=0.023), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}, 'per_dim_scale': {'per_dim_scale': WeightHParams(shape=[64], init=WeightInit(method='constant', scale=0.0), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}, 'post': {'w': WeightHParams(shape=[2048, 32, 64], init=WeightInit(method='gaussian', scale=0.023), dtype=<class 'jax.numpy.float32'>, collections=[], mesh_shape=None, tensor_split_dims_mapping=None, repeat_prefix=[24], repeat_prefix_split_dims_mapping=(-1,), repeat_optimizer_dims_mapping=(-1,), fan_in_axes=None, fan_out_axes=None)}}}}}}}}}
I0605 17:37:23.466831 139756486557120 base_layer.py:632] Creating var /lm/softmax/logits_ffn/linear/w with shape=[2048, 51200], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.022097086912079608
I0605 17:37:23.472266 139756486557120 base_layer.py:632] Creating var /lm/position_emb/emb_var with shape=[2048, 2048], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.561209 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.562514 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.578722 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/combined_qkv/w with shape=[3, 2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.583390 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/per_dim_scale/per_dim_scale with shape=[64], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.593972 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/post/w with shape=[2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.618214 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.619433 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.631588 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/linear/w with shape=[2048, 8192], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.635301 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/bias/b with shape=[8192], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.647808 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/linear/w with shape=[8192, 2048], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.651491 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/bias/b with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.706857 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.708141 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.731529 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/combined_qkv/w with shape=[3, 2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.736371 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/per_dim_scale/per_dim_scale with shape=[64], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.746949 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/post/w with shape=[2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.771519 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.772733 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.785490 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/linear/w with shape=[2048, 8192], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.789337 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/bias/b with shape=[8192], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.803040 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/linear/w with shape=[8192, 2048], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:37:23.806895 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/bias/b with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.824109 139756486557120 base_layer.py:632] Creating var /lm/final_ln/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.825375 139756486557120 base_layer.py:632] Creating var /lm/final_ln/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:37:23.831791 139756486557120 base_layer.py:632] Creating var /lm/softmax/logits_ffn/bias/b with shape=[51200], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:37:23.844305 139756486557120 dispatch.py:272] Finished tracing + transforming init_fn for pjit in 0.42580294609069824 sec
W0605 17:37:23.849604 139756486557120 pxla.py:1882] Compiling init_fn for with global shapes and types [ShapedArray(uint32[2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:23.854750 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003132820129394531 sec
W0605 17:37:23.855348 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_seed for pjit in 0.0013680458068847656 sec
W0605 17:37:23.856750 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.0001819133758544922 sec
W0605 17:37:23.857589 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0015332698822021484 sec
W0605 17:37:23.858543 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_fold_in for pjit in 0.004767656326293945 sec
W0605 17:37:23.862845 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00031375885009765625 sec
W0605 17:37:23.863823 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003020763397216797 sec
W0605 17:37:23.864712 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.000301361083984375 sec
W0605 17:37:23.865507 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004038810729980469 sec
W0605 17:37:23.915956 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00019168853759765625 sec
W0605 17:37:23.916870 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0016338825225830078 sec
W0605 17:37:23.917910 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_random_bits_original for pjit in 0.0029790401458740234 sec
W0605 17:37:23.921130 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00031447410583496094 sec
W0605 17:37:23.922111 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003139972686767578 sec
W0605 17:37:23.923000 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002968311309814453 sec
W0605 17:37:23.923702 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003113746643066406 sec
W0605 17:37:24.028707 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00019669532775878906 sec
W0605 17:37:24.029680 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.001708984375 sec
W0605 17:37:24.030701 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_random_bits_original for pjit in 0.0030384063720703125 sec
W0605 17:37:24.033909 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003120899200439453 sec
W0605 17:37:24.034902 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00031876564025878906 sec
W0605 17:37:24.035800 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00029659271240234375 sec
W0605 17:37:24.036511 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003082752227783203 sec
W0605 17:37:24.088186 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00018739700317382812 sec
W0605 17:37:24.088986 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.001508474349975586 sec
W0605 17:37:24.089989 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.002793550491333008 sec
W0605 17:37:24.093266 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004112720489501953 sec
W0605 17:37:24.094230 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003039836883544922 sec
W0605 17:37:24.095107 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00029397010803222656 sec
W0605 17:37:24.095797 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00030875205993652344 sec
W0605 17:37:24.143922 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00033354759216308594 sec
W0605 17:37:24.145487 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003466606140136719 sec
W0605 17:37:24.157827 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00036215782165527344 sec
W0605 17:37:24.227214 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00019693374633789062 sec
W0605 17:37:24.228162 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0016870498657226562 sec
W0605 17:37:24.229197 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_random_bits_original for pjit in 0.0030333995819091797 sec
W0605 17:37:24.232452 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.000308990478515625 sec
W0605 17:37:24.233449 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00033164024353027344 sec
W0605 17:37:24.234340 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00029277801513671875 sec
W0605 17:37:24.235043 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003159046173095703 sec
W0605 17:37:24.341019 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_random_bits_original for pjit in 0.0011565685272216797 sec
W0605 17:37:24.451697 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00019693374633789062 sec
W0605 17:37:24.452555 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0015981197357177734 sec
W0605 17:37:24.453699 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_random_bits_original for pjit in 0.003048419952392578 sec
W0605 17:37:24.456939 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003116130828857422 sec
W0605 17:37:24.457936 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00031113624572753906 sec
W0605 17:37:24.458815 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002963542938232422 sec
W0605 17:37:24.459515 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003075599670410156 sec
W0605 17:37:24.566185 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_random_bits_original for pjit in 0.00116729736328125 sec
W0605 17:37:24.644937 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(init_fn) in 0.7951579093933105 sec
W0605 17:37:39.883617 139756486557120 dispatch.py:272] Finished XLA compilation of jit(init_fn) in 15.23807430267334 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005ac380 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005ac380, semaphore=0x6060001683e0, value=10 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=11, fence=0x604000ef0a10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060005ac380, from_fence=0x606000195620 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000193460, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0007f6d70, f=0, wait_fence=0x6060005ac380 {0x6060001683e0:10}, signal_fence=0x604000ef0a10 {0x6060001683e0:11} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600079a8a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600079a900 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600079a8a0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600079a900, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600079a960 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600079a9c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600079a960, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600079a9c0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600079aa20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600075fbc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600079aa20, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600075fbc0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000681620 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b00c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681620, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b00c0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006f03e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006811a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006f03e0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006811a0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002b7720 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000681ec0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002b7720, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681ec0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000680de0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000681e60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000680de0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681e60, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b0600 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000682940 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b0600, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000682940, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b2040 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b25e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b2040, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b25e0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000681b00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b0540 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681b00, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b0540, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b2100 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006824c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b2100, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006824c0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002b77e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b1c80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002b77e0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b1c80, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b1440 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b0180 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b1440, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b0180, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b1860 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000681680 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b1860, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681680, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000682ca0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b2ac0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000682ca0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b2ac0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000681a40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b0f00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681a40, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b0f00, semaphore=0x6060001683e0, value=11 (OK)
I0605 17:37:40.306165 139756486557120 trainer_lib.py:398] initial_vars: {'params': {'lm': {'final_ln': {'bias': (2048,), 'scale': (2048,)}, 'position_emb': {'emb_var': (2048, 2048)}, 'softmax': {'logits_ffn': {'bias': {'b': (51200,)}, 'linear': {'w': (2048, 51200)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': (24, 8192)}, 'linear': {'w': (24, 2048, 8192)}}, 'ffn_layer2': {'bias': {'b': (24, 2048)}, 'linear': {'w': (24, 8192, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}, 'self_attention': {'combined_qkv': {'w': (24, 3, 2048, 32, 64)}, 'per_dim_scale': {'per_dim_scale': (24, 64)}, 'post': {'w': (24, 2048, 32, 64)}}}}}}}}}
W0605 17:37:40.307132 139756486557120 optimizers.py:1170] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
I0605 17:37:40.307184 139756486557120 optimizers.py:1173] Using sharded_adam.
W0605 17:37:40.307222 139756486557120 optimizers.py:580] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0002adb40 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:12}, signal={0x6060001682c0:13} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:13}, signal={0x6060001682c0:14} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bbe920 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600081a4c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bbe920, semaphore=0x6060001682c0, value=14 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600081a4c0, semaphore=0x6060001682c0, value=14 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0002b81c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:14}, signal={0x6060001682c0:15} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:15}, signal={0x6060001682c0:16} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bbea40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000753b00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bbea40, semaphore=0x6060001682c0, value=16 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000753b00, semaphore=0x6060001682c0, value=16 (OK)
W0605 17:37:40.309454 139756486557120 dispatch.py:272] Finished tracing + transforming jit(convert_element_type) in 0.0002548694610595703 sec
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0002b8700 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:16}, signal={0x6060001682c0:17} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:17}, signal={0x6060001682c0:18} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a9e080 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a9e1a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a9e080, semaphore=0x6060001682c0, value=18 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a9e1a0, semaphore=0x6060001682c0, value=18 (OK)
W0605 17:37:40.310383 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.00022101402282714844 sec
W0605 17:37:40.310681 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:40.314482 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.003664731979370117 sec
W0605 17:37:40.673562 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.35874080657958984 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003416c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003416c0, semaphore=0x6060001683e0, value=11 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=12, fence=0x604000f1d790 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060003416c0, from_fence=0x606000a9e080 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a9e1a0, semaphore=0x6060001683e0, value=12 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x6060003416c0 {0x6060001683e0:11, 0x6060001682c0:18}, signal_fence=0x604000f1d790 {0x6060001683e0:12} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b29a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b2b20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b29a0, semaphore=0x6060001683e0, value=12 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b2b20, semaphore=0x6060001683e0, value=12 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0002b87c0, wait={0x6060001682c0:18, 0x6060001683e0:12}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000a55f80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:18}, signal={0x6060001682c0:19} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:19}, signal={0x6060001682c0:20} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b07200 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b07140 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b07200, semaphore=0x6060001682c0, value=20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b07140, semaphore=0x6060001682c0, value=20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b070e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b070e0, semaphore=0x6060001683e0, value=12 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=13, fence=0x604000aa2c10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000b070e0, from_fence=0x606000b07200 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b07140, semaphore=0x6060001683e0, value=13 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000b070e0 {0x6060001683e0:12, 0x6060001682c0:20}, signal_fence=0x604000aa2c10 {0x6060001683e0:13} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005b9700 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005ba060 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005b9700, semaphore=0x6060001683e0, value=13 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005ba060, semaphore=0x6060001683e0, value=13 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000a55ec0, wait={0x6060001682c0:20, 0x6060001683e0:13}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000a28f80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:20}, signal={0x6060001682c0:21} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:21}, signal={0x6060001682c0:22} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000972b00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009712a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000972b00, semaphore=0x6060001682c0, value=22 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009712a0, semaphore=0x6060001682c0, value=22 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600096ffe0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600096ffe0, semaphore=0x6060001683e0, value=13 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=14, fence=0x6040008e9b10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600096ffe0, from_fence=0x606000972b00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009712a0, semaphore=0x6060001683e0, value=14 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x60600096ffe0 {0x6060001683e0:13, 0x6060001682c0:22}, signal_fence=0x6040008e9b10 {0x6060001683e0:14} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000970700 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600034d420 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000970700, semaphore=0x6060001683e0, value=14 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600034d420, semaphore=0x6060001683e0, value=14 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000a29040, wait={0x6060001682c0:22, 0x6060001683e0:14}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000a294c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:22}, signal={0x6060001682c0:23} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:23}, signal={0x6060001682c0:24} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000cebe80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000cebe20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cebe80, semaphore=0x6060001682c0, value=24 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cebe20, semaphore=0x6060001682c0, value=24 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000946460 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000946460, semaphore=0x6060001683e0, value=14 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=15, fence=0x604001474b90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000946460, from_fence=0x606000cebe80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cebe20, semaphore=0x6060001683e0, value=15 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000946460 {0x6060001683e0:14, 0x6060001682c0:24}, signal_fence=0x604001474b90 {0x6060001683e0:15} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000946700 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009467c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000946700, semaphore=0x6060001683e0, value=15 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009467c0, semaphore=0x6060001683e0, value=15 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000a29580, wait={0x6060001682c0:24, 0x6060001683e0:15}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000a28080 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:24}, signal={0x6060001682c0:25} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:25}, signal={0x6060001682c0:26} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009472a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000661b20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009472a0, semaphore=0x6060001682c0, value=26 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000661b20, semaphore=0x6060001682c0, value=26 (OK)
W0605 17:37:40.679012 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.00041937828063964844 sec
W0605 17:37:40.679480 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:40.684844 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.0051648616790771484 sec
W0605 17:37:41.056386 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.3710479736328125 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000967100 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000967100, semaphore=0x6060001683e0, value=15 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=16, fence=0x60400162c990 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000967100, from_fence=0x6060009472a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000661b20, semaphore=0x6060001683e0, value=16 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000469ec0, f=0, wait_fence=0x606000967100 {0x6060001683e0:15, 0x6060001682c0:26}, signal_fence=0x60400162c990 {0x6060001683e0:16} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000967e80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000967e20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000967e80, semaphore=0x6060001683e0, value=16 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000967e20, semaphore=0x6060001683e0, value=16 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000a28140, wait={0x6060001682c0:26, 0x6060001683e0:16}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0005de080 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:26}, signal={0x6060001682c0:27} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:27}, signal={0x6060001682c0:28} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000968cc0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000968c60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000968cc0, semaphore=0x6060001682c0, value=28 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000968c60, semaphore=0x6060001682c0, value=28 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007e8300 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007e8300, semaphore=0x6060001683e0, value=16 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=17, fence=0x60400117b350 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060007e8300, from_fence=0x606000968cc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000968c60, semaphore=0x6060001683e0, value=17 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000469ec0, f=0, wait_fence=0x6060007e8300 {0x6060001683e0:16, 0x6060001682c0:28}, signal_fence=0x60400117b350 {0x6060001683e0:17} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007e8240 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000761a80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007e8240, semaphore=0x6060001683e0, value=17 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000761a80, semaphore=0x6060001683e0, value=17 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b9d400, wait={0x6060001682c0:28, 0x6060001683e0:17}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000471e80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:28}, signal={0x6060001682c0:29} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:29}, signal={0x6060001682c0:30} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600084b120 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600084b0c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600084b120, semaphore=0x6060001682c0, value=30 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600084b0c0, semaphore=0x6060001682c0, value=30 (OK)
W0605 17:37:41.059996 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0002830028533935547 sec
W0605 17:37:41.060308 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:41.064166 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.003727436065673828 sec
W0605 17:37:41.432264 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.36777281761169434 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c9b000 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c9b000, semaphore=0x6060001683e0, value=17 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=18, fence=0x6040016a0110 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000c9b000, from_fence=0x60600084b120 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600084b0c0, semaphore=0x6060001683e0, value=18 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0008a7ea0, f=0, wait_fence=0x606000c9b000 {0x6060001683e0:17, 0x6060001682c0:30}, signal_fence=0x6040016a0110 {0x6060001683e0:18} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ca0c40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006442a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ca0c40, semaphore=0x6060001683e0, value=18 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006442a0, semaphore=0x6060001683e0, value=18 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b61e80, wait={0x6060001682c0:30, 0x6060001683e0:18}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000266200 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:30}, signal={0x6060001682c0:31} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:31}, signal={0x6060001682c0:32} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003a7a20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000391dc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a7a20, semaphore=0x6060001682c0, value=32 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000391dc0, semaphore=0x6060001682c0, value=32 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003902c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003902c0, semaphore=0x6060001683e0, value=18 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=19, fence=0x60400041ba10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060003902c0, from_fence=0x6060003a7a20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000391dc0, semaphore=0x6060001683e0, value=19 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0008a7ea0, f=0, wait_fence=0x6060003902c0 {0x6060001683e0:18, 0x6060001682c0:32}, signal_fence=0x60400041ba10 {0x6060001683e0:19} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003915e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060001c5e60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003915e0, semaphore=0x6060001683e0, value=19 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001c5e60, semaphore=0x6060001683e0, value=19 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000352240, wait={0x6060001682c0:32, 0x6060001683e0:19}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000a98e80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:32}, signal={0x6060001682c0:33} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:33}, signal={0x6060001682c0:34} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000145880 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bf8ee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000145880, semaphore=0x6060001682c0, value=34 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bf8ee0, semaphore=0x6060001682c0, value=34 (OK)
W0605 17:37:41.435681 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0003001689910888672 sec
W0605 17:37:41.435994 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:41.439823 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.003693819046020508 sec
W0605 17:37:41.806956 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.36679840087890625 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000981c80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000981c80, semaphore=0x6060001683e0, value=19 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=20, fence=0x6040014c8650 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000981c80, from_fence=0x606000145880 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bf8ee0, semaphore=0x6060001683e0, value=20 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0005eb9c0, f=0, wait_fence=0x606000981c80 {0x6060001683e0:19, 0x6060001682c0:34}, signal_fence=0x6040014c8650 {0x6060001683e0:20} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005719a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007cf6a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005719a0, semaphore=0x6060001683e0, value=20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007cf6a0, semaphore=0x6060001683e0, value=20 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000676c00, wait={0x6060001682c0:34, 0x6060001683e0:20}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000443080 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:34}, signal={0x6060001682c0:35} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:35}, signal={0x6060001682c0:36} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005da0a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000456140 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005da0a0, semaphore=0x6060001682c0, value=36 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000456140, semaphore=0x6060001682c0, value=36 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600082fb20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600082fb20, semaphore=0x6060001683e0, value=20 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=21, fence=0x60400161db50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600082fb20, from_fence=0x6060005da0a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000456140, semaphore=0x6060001683e0, value=21 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0005eb9c0, f=0, wait_fence=0x60600082fb20 {0x6060001683e0:20, 0x6060001682c0:36}, signal_fence=0x60400161db50 {0x6060001683e0:21} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600044a9e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600061e4a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600044a9e0, semaphore=0x6060001683e0, value=21 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600061e4a0, semaphore=0x6060001683e0, value=21 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000ad9ec0, wait={0x6060001682c0:36, 0x6060001683e0:21}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0009f4300 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:36}, signal={0x6060001682c0:37} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:37}, signal={0x6060001682c0:38} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a93c40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a93820 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a93c40, semaphore=0x6060001682c0, value=38 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a93820, semaphore=0x6060001682c0, value=38 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0002c6ec0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:38}, signal={0x6060001682c0:39} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:39}, signal={0x6060001682c0:40} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000927e00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000315740 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000927e00, semaphore=0x6060001682c0, value=40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000315740, semaphore=0x6060001682c0, value=40 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000c33280 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:40}, signal={0x6060001682c0:41} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:41}, signal={0x6060001682c0:42} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ba93e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007beea0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ba93e0, semaphore=0x6060001682c0, value=42 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007beea0, semaphore=0x6060001682c0, value=42 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00039c700 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:42}, signal={0x6060001682c0:43} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:43}, signal={0x6060001682c0:44} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b2c220 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c427a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b2c220, semaphore=0x6060001682c0, value=44 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c427a0, semaphore=0x6060001682c0, value=44 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0005fba80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:44}, signal={0x6060001682c0:45} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:45}, signal={0x6060001682c0:46} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c42b00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c422c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c42b00, semaphore=0x6060001682c0, value=46 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c422c0, semaphore=0x6060001682c0, value=46 (OK)
W0605 17:37:41.817803 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.00026488304138183594 sec
W0605 17:37:41.818148 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:41.822030 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.0037508010864257812 sec
W0605 17:37:42.176656 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.3542964458465576 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c8880 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8880, semaphore=0x6060001683e0, value=21 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=22, fence=0x60400096c810 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060008c8880, from_fence=0x606000c42b00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c422c0, semaphore=0x6060001683e0, value=22 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000a12c90, f=0, wait_fence=0x6060008c8880 {0x6060001683e0:21, 0x6060001682c0:46}, signal_fence=0x60400096c810 {0x6060001683e0:22} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003297e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c87c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003297e0, semaphore=0x6060001683e0, value=22 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c87c0, semaphore=0x6060001683e0, value=22 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0005fb600, wait={0x6060001682c0:46, 0x6060001683e0:22}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000189c40 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:46}, signal={0x6060001682c0:47} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:47}, signal={0x6060001682c0:48} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c8f40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c8c40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8f40, semaphore=0x6060001682c0, value=48 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8c40, semaphore=0x6060001682c0, value=48 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c8fa0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8fa0, semaphore=0x6060001683e0, value=22 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=23, fence=0x604000bb7e10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060008c8fa0, from_fence=0x6060008c8f40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8c40, semaphore=0x6060001683e0, value=23 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000a12c90, f=0, wait_fence=0x6060008c8fa0 {0x6060001683e0:22, 0x6060001682c0:48}, signal_fence=0x604000bb7e10 {0x6060001683e0:23} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c8700 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c8dc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8700, semaphore=0x6060001683e0, value=23 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8dc0, semaphore=0x6060001683e0, value=23 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0006394c0, wait={0x6060001682c0:48, 0x6060001683e0:23}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000b48500 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:48}, signal={0x6060001682c0:49} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:49}, signal={0x6060001682c0:50} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c7f80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c8040 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c7f80, semaphore=0x6060001682c0, value=50 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8040, semaphore=0x6060001682c0, value=50 (OK)
W0605 17:37:42.180253 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.00028324127197265625 sec
W0605 17:37:42.180571 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:42.184416 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.003713846206665039 sec
W0605 17:37:42.550396 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.365649938583374 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000cf2000 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cf2000, semaphore=0x6060001683e0, value=23 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=24, fence=0x6040002a0190 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000cf2000, from_fence=0x6060008c7f80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8040, semaphore=0x6060001683e0, value=24 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000710b30, f=0, wait_fence=0x606000cf2000 {0x6060001683e0:23, 0x6060001682c0:50}, signal_fence=0x6040002a0190 {0x6060001683e0:24} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600095efa0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000342920 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600095efa0, semaphore=0x6060001683e0, value=24 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000342920, semaphore=0x6060001683e0, value=24 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0006397c0, wait={0x6060001682c0:50, 0x6060001683e0:24}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00052ef40 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:50}, signal={0x6060001682c0:51} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:51}, signal={0x6060001682c0:52} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000821540 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009fe960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000821540, semaphore=0x6060001682c0, value=52 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009fe960, semaphore=0x6060001682c0, value=52 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d6e320 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d6e320, semaphore=0x6060001683e0, value=24 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=25, fence=0x6040009082d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000d6e320, from_fence=0x606000821540 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009fe960, semaphore=0x6060001683e0, value=25 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000710b30, f=0, wait_fence=0x606000d6e320 {0x6060001683e0:24, 0x6060001682c0:52}, signal_fence=0x6040009082d0 {0x6060001683e0:25} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c5820 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d6e5c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c5820, semaphore=0x6060001683e0, value=25 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d6e5c0, semaphore=0x6060001683e0, value=25 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00052f000, wait={0x6060001682c0:52, 0x6060001683e0:25}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00052e880 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:52}, signal={0x6060001682c0:53} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:53}, signal={0x6060001682c0:54} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008d3320 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a3da20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008d3320, semaphore=0x6060001682c0, value=54 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a3da20, semaphore=0x6060001682c0, value=54 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000131c00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000131c00, semaphore=0x6060001683e0, value=25 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=26, fence=0x6040006bcc10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000131c00, from_fence=0x6060008d3320 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a3da20, semaphore=0x6060001683e0, value=26 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000131c00 {0x6060001683e0:25, 0x6060001682c0:54}, signal_fence=0x6040006bcc10 {0x6060001683e0:26} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c2da40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009fe600 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c2da40, semaphore=0x6060001683e0, value=26 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009fe600, semaphore=0x6060001683e0, value=26 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00052e940, wait={0x6060001682c0:54, 0x6060001683e0:26}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000bc80c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:54}, signal={0x6060001682c0:55} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:55}, signal={0x6060001682c0:56} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d6db40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009823a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d6db40, semaphore=0x6060001682c0, value=56 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009823a0, semaphore=0x6060001682c0, value=56 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000732260 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000732260, semaphore=0x6060001683e0, value=26 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=27, fence=0x604000248d10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000732260, from_fence=0x606000d6db40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009823a0, semaphore=0x6060001683e0, value=27 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000732260 {0x6060001683e0:26, 0x6060001682c0:56}, signal_fence=0x604000248d10 {0x6060001683e0:27} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d4b580 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000254d20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d4b580, semaphore=0x6060001683e0, value=27 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000254d20, semaphore=0x6060001683e0, value=27 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000bc4880, wait={0x6060001682c0:56, 0x6060001683e0:27}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000bc6800 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:56}, signal={0x6060001682c0:57} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:57}, signal={0x6060001682c0:58} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b7e240 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b01b60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b7e240, semaphore=0x6060001682c0, value=58 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b01b60, semaphore=0x6060001682c0, value=58 (OK)
W0605 17:37:42.556579 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0002880096435546875 sec
W0605 17:37:42.556906 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:42.560795 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.003753662109375 sec
W0605 17:37:42.913717 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.35259294509887695 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600032d3e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600032d3e0, semaphore=0x6060001683e0, value=27 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=28, fence=0x60400054ed50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600032d3e0, from_fence=0x606000b7e240 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b01b60, semaphore=0x6060001683e0, value=28 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000a0ef60, f=0, wait_fence=0x60600032d3e0 {0x6060001683e0:27, 0x6060001682c0:58}, signal_fence=0x60400054ed50 {0x6060001683e0:28} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600072e9c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600037f940 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072e9c0, semaphore=0x6060001683e0, value=28 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037f940, semaphore=0x6060001683e0, value=28 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000bc5000, wait={0x6060001682c0:58, 0x6060001683e0:28}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000b8b580 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:58}, signal={0x6060001682c0:59} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:59}, signal={0x6060001682c0:60} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600036e900 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600037f700 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600036e900, semaphore=0x6060001682c0, value=60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037f700, semaphore=0x6060001682c0, value=60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600072e540 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072e540, semaphore=0x6060001683e0, value=28 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=29, fence=0x604001347e50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600072e540, from_fence=0x60600036e900 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037f700, semaphore=0x6060001683e0, value=29 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000a0ef60, f=0, wait_fence=0x60600072e540 {0x6060001683e0:28, 0x6060001682c0:60}, signal_fence=0x604001347e50 {0x6060001683e0:29} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600036e060 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600072eae0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600036e060, semaphore=0x6060001683e0, value=29 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072eae0, semaphore=0x6060001683e0, value=29 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b8b1c0, wait={0x6060001682c0:60, 0x6060001683e0:29}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0005eb1c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:60}, signal={0x6060001682c0:61} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:61}, signal={0x6060001682c0:62} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600037f640 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600037f820 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037f640, semaphore=0x6060001682c0, value=62 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037f820, semaphore=0x6060001682c0, value=62 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000006680 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000006680, semaphore=0x6060001683e0, value=29 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=30, fence=0x604001319f90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000006680, from_fence=0x60600037f640 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037f820, semaphore=0x6060001683e0, value=30 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000006680 {0x6060001683e0:29, 0x6060001682c0:62}, signal_fence=0x604001319f90 {0x6060001683e0:30} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003295a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000329660 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003295a0, semaphore=0x6060001683e0, value=30 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000329660, semaphore=0x6060001683e0, value=30 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0005eb7c0, wait={0x6060001682c0:62, 0x6060001683e0:30}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0002c7340 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:62}, signal={0x6060001682c0:63} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:63}, signal={0x6060001682c0:64} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600037d720 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000129a40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037d720, semaphore=0x6060001682c0, value=64 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000129a40, semaphore=0x6060001682c0, value=64 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000129980 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000129980, semaphore=0x6060001683e0, value=30 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=31, fence=0x60400131fb90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000129980, from_fence=0x60600037d720 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000129a40, semaphore=0x6060001683e0, value=31 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000129980 {0x6060001683e0:30, 0x6060001682c0:64}, signal_fence=0x60400131fb90 {0x6060001683e0:31} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000129680 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060001cd8a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000129680, semaphore=0x6060001683e0, value=31 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001cd8a0, semaphore=0x6060001683e0, value=31 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0002c7400, wait={0x6060001682c0:64, 0x6060001683e0:31}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000054280 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:64}, signal={0x6060001682c0:65} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:65}, signal={0x6060001682c0:66} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000cf020 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000c76a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000cf020, semaphore=0x6060001682c0, value=66 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000c76a0, semaphore=0x6060001682c0, value=66 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600037d480 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037d480, semaphore=0x6060001683e0, value=31 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=32, fence=0x604000ab0c10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600037d480, from_fence=0x6060000cf020 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000c76a0, semaphore=0x6060001683e0, value=32 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x60600037d480 {0x6060001683e0:31, 0x6060001682c0:66}, signal_fence=0x604000ab0c10 {0x6060001683e0:32} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000006860 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060001a9120 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000006860, semaphore=0x6060001683e0, value=32 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001a9120, semaphore=0x6060001683e0, value=32 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000332980, wait={0x6060001682c0:66, 0x6060001683e0:32}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000ba52c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:66}, signal={0x6060001682c0:67} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:67}, signal={0x6060001682c0:68} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000192440 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600075ffe0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000192440, semaphore=0x6060001682c0, value=68 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600075ffe0, semaphore=0x6060001682c0, value=68 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600052df60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600052df60, semaphore=0x6060001683e0, value=32 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=33, fence=0x604000aae850 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600052df60, from_fence=0x606000192440 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600075ffe0, semaphore=0x6060001683e0, value=33 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x60600052df60 {0x6060001683e0:32, 0x6060001682c0:68}, signal_fence=0x604000aae850 {0x6060001683e0:33} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000a1780 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600052e1a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000a1780, semaphore=0x6060001683e0, value=33 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600052e1a0, semaphore=0x6060001683e0, value=33 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000344200, wait={0x6060001682c0:68, 0x6060001683e0:33}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0006832c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:68}, signal={0x6060001682c0:69} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:69}, signal={0x6060001682c0:70} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccb480 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccb540 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccb480, semaphore=0x6060001682c0, value=70 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccb540, semaphore=0x6060001682c0, value=70 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccb3c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccb3c0, semaphore=0x6060001683e0, value=33 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=34, fence=0x604000dd5110 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000ccb3c0, from_fence=0x606000ccb480 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccb540, semaphore=0x6060001683e0, value=34 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000ccb3c0 {0x6060001683e0:33, 0x6060001682c0:70}, signal_fence=0x604000dd5110 {0x6060001683e0:34} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccbb40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccb720 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccbb40, semaphore=0x6060001683e0, value=34 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccb720, semaphore=0x6060001683e0, value=34 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00022d440, wait={0x6060001682c0:70, 0x6060001683e0:34}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000a85800 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:70}, signal={0x6060001682c0:71} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:71}, signal={0x6060001682c0:72} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccbf60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccbe40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccbf60, semaphore=0x6060001682c0, value=72 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccbe40, semaphore=0x6060001682c0, value=72 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccbd80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccbd80, semaphore=0x6060001683e0, value=34 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=35, fence=0x604000727d10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000ccbd80, from_fence=0x606000ccbf60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccbe40, semaphore=0x6060001683e0, value=35 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x606000ccbd80 {0x6060001683e0:34, 0x6060001682c0:72}, signal_fence=0x604000727d10 {0x6060001683e0:35} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccbcc0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600072c140 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccbcc0, semaphore=0x6060001683e0, value=35 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072c140, semaphore=0x6060001683e0, value=35 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000079e40, wait={0x6060001682c0:72, 0x6060001683e0:35}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00006b5c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:72}, signal={0x6060001682c0:73} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:73}, signal={0x6060001682c0:74} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004d3c60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600025a0c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004d3c60, semaphore=0x6060001682c0, value=74 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600025a0c0, semaphore=0x6060001682c0, value=74 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006fe8a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006fe8a0, semaphore=0x6060001683e0, value=35 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=36, fence=0x6040007284d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060006fe8a0, from_fence=0x6060004d3c60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600025a0c0, semaphore=0x6060001683e0, value=36 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x6060006fe8a0 {0x6060001683e0:35, 0x6060001682c0:74}, signal_fence=0x6040007284d0 {0x6060001683e0:36} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600072c080 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005f5ee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072c080, semaphore=0x6060001683e0, value=36 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005f5ee0, semaphore=0x6060001683e0, value=36 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0004a8780, wait={0x6060001682c0:74, 0x6060001683e0:36}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000676900 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:74}, signal={0x6060001682c0:75} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:75}, signal={0x6060001682c0:76} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004d30c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007d5520 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004d30c0, semaphore=0x6060001682c0, value=76 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007d5520, semaphore=0x6060001682c0, value=76 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003996e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003996e0, semaphore=0x6060001683e0, value=36 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=37, fence=0x604000231e10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060003996e0, from_fence=0x6060004d30c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007d5520, semaphore=0x6060001683e0, value=37 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004d9af0, f=0, wait_fence=0x6060003996e0 {0x6060001683e0:36, 0x6060001682c0:76}, signal_fence=0x604000231e10 {0x6060001683e0:37} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600036fbc0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008daca0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600036fbc0, semaphore=0x6060001683e0, value=37 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008daca0, semaphore=0x6060001683e0, value=37 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0009b5cc0, wait={0x6060001682c0:76, 0x6060001683e0:37}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000685000 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:76}, signal={0x6060001682c0:77} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:77}, signal={0x6060001682c0:78} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000978da0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000978440 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000978da0, semaphore=0x6060001682c0, value=78 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000978440, semaphore=0x6060001682c0, value=78 (OK)
W0605 17:37:42.924442 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0002827644348144531 sec
W0605 17:37:42.924763 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:42.928637 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.0037441253662109375 sec
W0605 17:37:43.298731 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.36975932121276855 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c3b300 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c3b300, semaphore=0x6060001683e0, value=37 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=38, fence=0x6040015e6290 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000c3b300, from_fence=0x606000978da0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000978440, semaphore=0x6060001683e0, value=38 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0008a2490, f=0, wait_fence=0x606000c3b300 {0x6060001683e0:37, 0x6060001682c0:78}, signal_fence=0x6040015e6290 {0x6060001683e0:38} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c44960 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600018fb60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c44960, semaphore=0x6060001683e0, value=38 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600018fb60, semaphore=0x6060001683e0, value=38 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000bd0f40, wait={0x6060001682c0:78, 0x6060001683e0:38}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0009b2a80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:78}, signal={0x6060001682c0:79} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:79}, signal={0x6060001682c0:80} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000628be0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b6f540 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000628be0, semaphore=0x6060001682c0, value=80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b6f540, semaphore=0x6060001682c0, value=80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000628e80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000628e80, semaphore=0x6060001683e0, value=38 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=39, fence=0x604000159250 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000628e80, from_fence=0x606000628be0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b6f540, semaphore=0x6060001683e0, value=39 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0008a2490, f=0, wait_fence=0x606000628e80 {0x6060001683e0:38, 0x6060001682c0:80}, signal_fence=0x604000159250 {0x6060001683e0:39} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b37da0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b5400 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b37da0, semaphore=0x6060001683e0, value=39 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b5400, semaphore=0x6060001683e0, value=39 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0009b2b40, wait={0x6060001682c0:80, 0x6060001683e0:39}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0009b2fc0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:80}, signal={0x6060001682c0:81} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:81}, signal={0x6060001682c0:82} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d64480 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d64360 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d64480, semaphore=0x6060001682c0, value=82 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d64360, semaphore=0x6060001682c0, value=82 (OK)
W0605 17:37:43.303542 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0002834796905517578 sec
W0605 17:37:43.303862 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:43.307848 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.0038557052612304688 sec
W0605 17:37:43.702368 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.3941676616668701 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006c8960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006c8960, semaphore=0x6060001683e0, value=39 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=40, fence=0x6040015f1f90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060006c8960, from_fence=0x606000d64480 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d64360, semaphore=0x6060001683e0, value=40 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00035ca30, f=0, wait_fence=0x6060006c8960 {0x6060001683e0:39, 0x6060001682c0:82}, signal_fence=0x6040015f1f90 {0x6060001683e0:40} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bb9ee0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bb9fa0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bb9ee0, semaphore=0x6060001683e0, value=40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bb9fa0, semaphore=0x6060001683e0, value=40 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00098f080, wait={0x6060001682c0:82, 0x6060001683e0:40}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000b13100 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:82}, signal={0x6060001682c0:83} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:83}, signal={0x6060001682c0:84} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bb93a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d0f820 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bb93a0, semaphore=0x6060001682c0, value=84 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d0f820, semaphore=0x6060001682c0, value=84 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003d5500 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003d5500, semaphore=0x6060001683e0, value=40 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=41, fence=0x604001406190 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060003d5500, from_fence=0x606000bb93a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d0f820, semaphore=0x6060001683e0, value=41 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00035ca30, f=0, wait_fence=0x6060003d5500 {0x6060001683e0:40, 0x6060001682c0:84}, signal_fence=0x604001406190 {0x6060001683e0:41} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000283ac0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003d5aa0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000283ac0, semaphore=0x6060001683e0, value=41 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003d5aa0, semaphore=0x6060001683e0, value=41 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000768340, wait={0x6060001682c0:84, 0x6060001683e0:41}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000c4a2c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:84}, signal={0x6060001682c0:85} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:85}, signal={0x6060001682c0:86} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600050d080 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600050d0e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600050d080, semaphore=0x6060001682c0, value=86 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600050d0e0, semaphore=0x6060001682c0, value=86 (OK)
W0605 17:37:43.708622 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0005459785461425781 sec
W0605 17:37:43.709253 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:43.713417 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.0039212703704833984 sec
W0605 17:37:44.077934 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.3641831874847412 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002b71e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002b71e0, semaphore=0x6060001683e0, value=41 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=42, fence=0x604000450490 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060002b71e0, from_fence=0x60600050d080 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600050d0e0, semaphore=0x6060001683e0, value=42 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0007efaa0, f=0, wait_fence=0x6060002b71e0 {0x6060001683e0:41, 0x6060001682c0:86}, signal_fence=0x604000450490 {0x6060001683e0:42} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000e9240 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005bfb20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000e9240, semaphore=0x6060001683e0, value=42 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005bfb20, semaphore=0x6060001683e0, value=42 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b7c1c0, wait={0x6060001682c0:86, 0x6060001683e0:42}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000b42c80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:86}, signal={0x6060001682c0:87} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:87}, signal={0x6060001682c0:88} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005f4e00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600026dce0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005f4e00, semaphore=0x6060001682c0, value=88 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600026dce0, semaphore=0x6060001682c0, value=88 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005821a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005821a0, semaphore=0x6060001683e0, value=42 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=43, fence=0x6040011e9d10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060005821a0, from_fence=0x6060005f4e00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600026dce0, semaphore=0x6060001683e0, value=43 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0007efaa0, f=0, wait_fence=0x6060005821a0 {0x6060001683e0:42, 0x6060001682c0:88}, signal_fence=0x6040011e9d10 {0x6060001683e0:43} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005b2e60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600073d120 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005b2e60, semaphore=0x6060001683e0, value=43 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600073d120, semaphore=0x6060001683e0, value=43 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0006556c0, wait={0x6060001682c0:88, 0x6060001683e0:43}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000655600 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:88}, signal={0x6060001682c0:89} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:89}, signal={0x6060001682c0:90} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600029f840 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000609f20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600029f840, semaphore=0x6060001682c0, value=90 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000609f20, semaphore=0x6060001682c0, value=90 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00006ce80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:90}, signal={0x6060001682c0:91} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:91}, signal={0x6060001682c0:92} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bccc60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c7aa20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bccc60, semaphore=0x6060001682c0, value=92 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c7aa20, semaphore=0x6060001682c0, value=92 (OK)
W0605 17:37:44.082766 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0002627372741699219 sec
W0605 17:37:44.083084 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(int32[])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:44.087130 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.003915309906005859 sec
W0605 17:37:44.459510 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.3720529079437256 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009638c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009638c0, semaphore=0x6060001683e0, value=43 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=44, fence=0x604000ddb850 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060009638c0, from_fence=0x606000ba93e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007beea0, semaphore=0x6060001683e0, value=44 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00011f9b0, f=0, wait_fence=0x6060009638c0 {0x6060001683e0:43, 0x6060001682c0:42}, signal_fence=0x604000ddb850 {0x6060001683e0:44} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000965a80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006dbb00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000965a80, semaphore=0x6060001683e0, value=44 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006dbb00, semaphore=0x6060001683e0, value=44 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006dbe00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006dbe00, semaphore=0x6060001683e0, value=44 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=45, fence=0x604000b1dc50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060006dbe00, from_fence=0x606000b2c220 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c427a0, semaphore=0x6060001683e0, value=45 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00011f9b0, f=0, wait_fence=0x6060006dbe00 {0x6060001683e0:44, 0x6060001682c0:44}, signal_fence=0x604000b1dc50 {0x6060001683e0:45} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008210c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600086c7e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008210c0, semaphore=0x6060001683e0, value=45 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600086c7e0, semaphore=0x6060001683e0, value=45 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600015abe0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600015abe0, semaphore=0x6060001683e0, value=45 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=46, fence=0x60400013e310 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600015abe0, from_fence=0x60600029f840 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000609f20, semaphore=0x6060001683e0, value=46 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00011f9b0, f=0, wait_fence=0x60600015abe0 {0x6060001683e0:45, 0x6060001682c0:90}, signal_fence=0x60400013e310 {0x6060001683e0:46} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007a8160 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000503c60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007a8160, semaphore=0x6060001683e0, value=46 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000503c60, semaphore=0x6060001683e0, value=46 (OK)
W0605 17:37:44.462649 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0002999305725097656 sec
W0605 17:37:44.462978 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[8192])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:44.466973 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.0038557052612304688 sec
W0605 17:37:44.827181 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.359877347946167 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000981e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000981e0, semaphore=0x6060001683e0, value=46 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=47, fence=0x604000d97d10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060000981e0, from_fence=0x6060003297e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c87c0, semaphore=0x6060001683e0, value=47 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0007d79a0, f=0, wait_fence=0x6060000981e0 {0x6060001683e0:46}, signal_fence=0x604000d97d10 {0x6060001683e0:47} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600009c7a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000779d80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600009c7a0, semaphore=0x6060001683e0, value=47 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000779d80, semaphore=0x6060001683e0, value=47 (OK)
W0605 17:37:44.829687 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0003190040588378906 sec
W0605 17:37:44.830013 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[2048,8192])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:44.834094 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.003939390182495117 sec
W0605 17:37:45.209967 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.37554359436035156 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000356660 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000356660, semaphore=0x6060001683e0, value=47 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=48, fence=0x604000ddf9d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000356660, from_fence=0x60600095efa0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000342920, semaphore=0x6060001683e0, value=48 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0003d73a0, f=0, wait_fence=0x606000356660 {0x6060001683e0:47}, signal_fence=0x604000ddf9d0 {0x6060001683e0:48} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004af600 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003eb820 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004af600, semaphore=0x6060001683e0, value=48 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003eb820, semaphore=0x6060001683e0, value=48 (OK)
W0605 17:37:45.224115 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.00031113624572753906 sec
W0605 17:37:45.224486 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[2048])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:45.228661 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.004036903381347656 sec
W0605 17:37:45.607018 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.37800025939941406 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005cca20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005cca20, semaphore=0x6060001683e0, value=48 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=49, fence=0x60400124fe10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060005cca20, from_fence=0x606000c2da40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009fe600, semaphore=0x6060001683e0, value=49 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x6060005cca20 {0x6060001683e0:48}, signal_fence=0x60400124fe10 {0x6060001683e0:49} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600049fc40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004e61a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600049fc40, semaphore=0x6060001683e0, value=49 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004e61a0, semaphore=0x6060001683e0, value=49 (OK)
W0605 17:37:45.610910 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0005667209625244141 sec
W0605 17:37:45.611510 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[8192,2048])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:45.616619 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.00486302375793457 sec
W0605 17:37:45.954178 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.33722805976867676 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009c0920 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009c0920, semaphore=0x6060001683e0, value=49 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=50, fence=0x604000148c10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060009c0920, from_fence=0x60600072e9c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037f940, semaphore=0x6060001683e0, value=50 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000843b60, f=0, wait_fence=0x6060009c0920 {0x6060001683e0:49}, signal_fence=0x604000148c10 {0x6060001683e0:50} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004fbaa0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003a06a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004fbaa0, semaphore=0x6060001683e0, value=50 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a06a0, semaphore=0x6060001683e0, value=50 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000844f40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000844f40, semaphore=0x6060001683e0, value=50 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=51, fence=0x6040008431d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000844f40, from_fence=0x6060003295a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000329660, semaphore=0x6060001683e0, value=51 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x606000844f40 {0x6060001683e0:50}, signal_fence=0x6040008431d0 {0x6060001683e0:51} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003f5780 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600089cfc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003f5780, semaphore=0x6060001683e0, value=51 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600089cfc0, semaphore=0x6060001683e0, value=51 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c407c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c407c0, semaphore=0x6060001683e0, value=51 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=52, fence=0x6040000f5a90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000c407c0, from_fence=0x606000006860 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001a9120, semaphore=0x6060001683e0, value=52 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x606000c407c0 {0x6060001683e0:51}, signal_fence=0x6040000f5a90 {0x6060001683e0:52} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009c0200 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c41900 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009c0200, semaphore=0x6060001683e0, value=52 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c41900, semaphore=0x6060001683e0, value=52 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c41960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c41960, semaphore=0x6060001683e0, value=52 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=53, fence=0x60400165c3d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000c41960, from_fence=0x606000ccbb40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccb720, semaphore=0x6060001683e0, value=53 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x606000c41960 {0x6060001683e0:52}, signal_fence=0x60400165c3d0 {0x6060001683e0:53} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009c08c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009e2940 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009c08c0, semaphore=0x6060001683e0, value=53 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009e2940, semaphore=0x6060001683e0, value=53 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bc9d20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bc9d20, semaphore=0x6060001683e0, value=53 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=54, fence=0x60400165c5d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000bc9d20, from_fence=0x60600072c080 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005f5ee0, semaphore=0x6060001683e0, value=54 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x606000bc9d20 {0x6060001683e0:53}, signal_fence=0x60400165c5d0 {0x6060001683e0:54} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bca3e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bca5c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bca3e0, semaphore=0x6060001683e0, value=54 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bca5c0, semaphore=0x6060001683e0, value=54 (OK)
W0605 17:37:45.970173 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.00032019615173339844 sec
W0605 17:37:45.970548 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[3,2048,32,64])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:45.974761 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.004075050354003906 sec
W0605 17:37:46.345563 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.370466947555542 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006592a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006592a0, semaphore=0x6060001683e0, value=54 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=55, fence=0x6040005d00d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060006592a0, from_fence=0x606000c44960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600018fb60, semaphore=0x6060001683e0, value=55 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000a36100, f=0, wait_fence=0x6060006592a0 {0x6060001683e0:54}, signal_fence=0x6040005d00d0 {0x6060001683e0:55} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600065b940 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000659900 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600065b940, semaphore=0x6060001683e0, value=55 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000659900, semaphore=0x6060001683e0, value=55 (OK)
W0605 17:37:46.357086 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.00030922889709472656 sec
W0605 17:37:46.357457 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[64])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:46.361810 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.004218578338623047 sec
W0605 17:37:46.735365 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.3732173442840576 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007ea280 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007ea280, semaphore=0x6060001683e0, value=55 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=56, fence=0x6040014db3d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060007ea280, from_fence=0x606000bb9ee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bb9fa0, semaphore=0x6060001683e0, value=56 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00036c520, f=0, wait_fence=0x6060007ea280 {0x6060001683e0:55}, signal_fence=0x6040014db3d0 {0x6060001683e0:56} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600023cde0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009040a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600023cde0, semaphore=0x6060001683e0, value=56 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009040a0, semaphore=0x6060001683e0, value=56 (OK)
W0605 17:37:46.737891 139756486557120 dispatch.py:272] Finished tracing + transforming jit(broadcast_in_dim) in 0.0003180503845214844 sec
W0605 17:37:46.738218 139756486557120 pxla.py:1882] Compiling broadcast_in_dim for with global shapes and types [ShapedArray(float32[2048,32,64])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:46.742366 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(broadcast_in_dim) in 0.004011631011962891 sec
W0605 17:37:47.107496 139756486557120 dispatch.py:272] Finished XLA compilation of jit(broadcast_in_dim) in 0.36479902267456055 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a78d00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a78d00, semaphore=0x6060001683e0, value=56 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=57, fence=0x6040011bac90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000a78d00, from_fence=0x6060000e9240 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005bfb20, semaphore=0x6060001683e0, value=57 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0003c8250, f=0, wait_fence=0x606000a78d00 {0x6060001683e0:56}, signal_fence=0x6040011bac90 {0x6060001683e0:57} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060001619c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002a2c00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001619c0, semaphore=0x6060001683e0, value=57 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002a2c00, semaphore=0x6060001683e0, value=57 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600086a9e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600086a9e0, semaphore=0x6060001683e0, value=57 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=58, fence=0x604000895650 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600086a9e0, from_fence=0x6060008c8700 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c8dc0, semaphore=0x6060001683e0, value=58 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0007d79a0, f=0, wait_fence=0x60600086a9e0 {0x6060001683e0:57}, signal_fence=0x604000895650 {0x6060001683e0:58} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006d96a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600035b820 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006d96a0, semaphore=0x6060001683e0, value=58 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600035b820, semaphore=0x6060001683e0, value=58 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600086a680 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600086a680, semaphore=0x6060001683e0, value=58 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=59, fence=0x60400163e550 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600086a680, from_fence=0x6060008c5820 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d6e5c0, semaphore=0x6060001683e0, value=59 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0003d73a0, f=0, wait_fence=0x60600086a680 {0x6060001683e0:58}, signal_fence=0x60400163e550 {0x6060001683e0:59} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600086a5c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000cbc960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600086a5c0, semaphore=0x6060001683e0, value=59 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cbc960, semaphore=0x6060001683e0, value=59 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600011b4c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600011b4c0, semaphore=0x6060001683e0, value=59 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=60, fence=0x60400154e050 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600011b4c0, from_fence=0x606000d4b580 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000254d20, semaphore=0x6060001683e0, value=60 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x60600011b4c0 {0x6060001683e0:59}, signal_fence=0x60400154e050 {0x6060001683e0:60} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006fdf40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000452720 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006fdf40, semaphore=0x6060001683e0, value=60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000452720, semaphore=0x6060001683e0, value=60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000791540 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000791540, semaphore=0x6060001683e0, value=60 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=61, fence=0x6040000eaa50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000791540, from_fence=0x60600036e060 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072eae0, semaphore=0x6060001683e0, value=61 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000843b60, f=0, wait_fence=0x606000791540 {0x6060001683e0:60}, signal_fence=0x6040000eaa50 {0x6060001683e0:61} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c07160 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003f8480 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c07160, semaphore=0x6060001683e0, value=61 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003f8480, semaphore=0x6060001683e0, value=61 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003f83c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003f83c0, semaphore=0x6060001683e0, value=61 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=62, fence=0x604000e02990 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060003f83c0, from_fence=0x606000129680 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001cd8a0, semaphore=0x6060001683e0, value=62 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x6060003f83c0 {0x6060001683e0:61}, signal_fence=0x604000e02990 {0x6060001683e0:62} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009a7960 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d16de0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009a7960, semaphore=0x6060001683e0, value=62 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d16de0, semaphore=0x6060001683e0, value=62 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003e7500 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003e7500, semaphore=0x6060001683e0, value=62 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=63, fence=0x604001627210 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060003e7500, from_fence=0x6060000a1780 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600052e1a0, semaphore=0x6060001683e0, value=63 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x6060003e7500 {0x6060001683e0:62}, signal_fence=0x604001627210 {0x6060001683e0:63} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005cd560 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007c6280 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005cd560, semaphore=0x6060001683e0, value=63 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007c6280, semaphore=0x6060001683e0, value=63 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004c9e20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004c9e20, semaphore=0x6060001683e0, value=63 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=64, fence=0x604000d12d50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060004c9e20, from_fence=0x606000ccbcc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072c140, semaphore=0x6060001683e0, value=64 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x6060004c9e20 {0x6060001683e0:63}, signal_fence=0x604000d12d50 {0x6060001683e0:64} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002c5ee0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c5340 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002c5ee0, semaphore=0x6060001683e0, value=64 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c5340, semaphore=0x6060001683e0, value=64 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b40ec0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b40ec0, semaphore=0x6060001683e0, value=64 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=65, fence=0x604001510cd0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000b40ec0, from_fence=0x60600036fbc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008daca0, semaphore=0x6060001683e0, value=65 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00070d380, f=0, wait_fence=0x606000b40ec0 {0x6060001683e0:64}, signal_fence=0x604001510cd0 {0x6060001683e0:65} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600038bd00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b5d360 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600038bd00, semaphore=0x6060001683e0, value=65 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b5d360, semaphore=0x6060001683e0, value=65 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000879140 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000879140, semaphore=0x6060001683e0, value=65 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=66, fence=0x604000f3c8d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000879140, from_fence=0x606000b37da0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b5400, semaphore=0x6060001683e0, value=66 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000a36100, f=0, wait_fence=0x606000879140 {0x6060001683e0:65}, signal_fence=0x604000f3c8d0 {0x6060001683e0:66} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006d6160 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a045a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006d6160, semaphore=0x6060001683e0, value=66 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a045a0, semaphore=0x6060001683e0, value=66 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600062edc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600062edc0, semaphore=0x6060001683e0, value=66 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=67, fence=0x604000533b50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600062edc0, from_fence=0x606000283ac0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003d5aa0, semaphore=0x6060001683e0, value=67 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00036c520, f=0, wait_fence=0x60600062edc0 {0x6060001683e0:66}, signal_fence=0x604000533b50 {0x6060001683e0:67} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005686a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b56e20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005686a0, semaphore=0x6060001683e0, value=67 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b56e20, semaphore=0x6060001683e0, value=67 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005b4d20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005b4d20, semaphore=0x6060001683e0, value=67 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=68, fence=0x60400166c610 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060005b4d20, from_fence=0x6060005b2e60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600073d120, semaphore=0x6060001683e0, value=68 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0003c8250, f=0, wait_fence=0x6060005b4d20 {0x6060001683e0:67}, signal_fence=0x60400166c610 {0x6060001683e0:68} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a1fae0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b57240 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a1fae0, semaphore=0x6060001683e0, value=68 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b57240, semaphore=0x6060001683e0, value=68 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a3a780 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a3a780, semaphore=0x6060001683e0, value=68 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=69, fence=0x604000f0e110 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000a3a780, from_fence=0x606000bccc60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c7aa20, semaphore=0x6060001683e0, value=69 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00011f9b0, f=0, wait_fence=0x606000a3a780 {0x6060001683e0:68, 0x6060001682c0:92}, signal_fence=0x604000f0e110 {0x6060001683e0:69} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000abce00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000771f80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000abce00, semaphore=0x6060001683e0, value=69 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000771f80, semaphore=0x6060001683e0, value=69 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0006553c0, wait={0x6060001682c0:92, 0x6060001683e0:69}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b731c0, wait={0x6060001683e0:68}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00023b6c0, wait={0x6060001683e0:67}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0009b26c0, wait={0x6060001683e0:66}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000354dc0, wait={0x6060001683e0:65}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000a85ec0, wait={0x6060001683e0:64}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000320b00, wait={0x6060001683e0:63}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00014f200, wait={0x6060001683e0:62}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000406000, wait={0x6060001683e0:61}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00054f4c0, wait={0x6060001683e0:60}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00052e700, wait={0x6060001683e0:59}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b473c0, wait={0x6060001683e0:58}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00035e000, wait={0x6060001683e0:57}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b13280, wait={0x6060001683e0:56}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0009b2840, wait={0x6060001683e0:55}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000bc53c0, wait={0x6060001683e0:54}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00041d700, wait={0x6060001683e0:53}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000319540, wait={0x6060001683e0:52}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b2f480, wait={0x6060001683e0:51}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0005eb640, wait={0x6060001683e0:50}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000bc7e80, wait={0x6060001683e0:49}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00052ed00, wait={0x6060001683e0:48}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000639880, wait={0x6060001683e0:47}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000b72c80, wait={0x6060001682c0:90, 0x6060001683e0:46}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0000a2c40, wait={0x6060001682c0:44, 0x6060001683e0:45}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00034f540, wait={0x6060001682c0:42, 0x6060001683e0:44}, signal={} (OK)
W0605 17:37:47.177995 139756486557120 dispatch.py:272] Finished tracing + transforming jit(convert_element_type) in 0.00024437904357910156 sec
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000243a00 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:92}, signal={0x6060001682c0:93} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:93}, signal={0x6060001682c0:94} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a17c20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004cfdc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a17c20, semaphore=0x6060001682c0, value=94 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004cfdc0, semaphore=0x6060001682c0, value=94 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x6080025003a0, wait={0x6060001683e0:10}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x608002500320, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x6080025002a0, wait={0x6060001683e0:10}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x608002500220, wait={0x6060001683e0:10}, signal={} (OK)
I0605 17:37:47.196380 139756486557120 partitioning.py:631] train state shapes: TrainState(step=(), mdl_vars={'params': {'lm': {'final_ln': {'bias': (2048,), 'scale': (2048,)}, 'position_emb': {'emb_var': (2048, 2048)}, 'softmax': {'logits_ffn': {'bias': {'b': (51200,)}, 'linear': {'w': (2048, 51200)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': (24, 8192)}, 'linear': {'w': (24, 2048, 8192)}}, 'ffn_layer2': {'bias': {'b': (24, 2048)}, 'linear': {'w': (24, 8192, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}, 'self_attention': {'combined_qkv': {'w': (24, 3, 2048, 32, 64)}, 'per_dim_scale': {'per_dim_scale': (24, 64)}, 'post': {'w': (24, 2048, 32, 64)}}}}}}}}}, opt_states=[{'no_prefix': ({'count': ()}, {'count': ()}, {'count': (), 'm': {'params': {'lm': {'final_ln': {'bias': (2048,), 'scale': (2048,)}, 'position_emb': {'emb_var': (2048, 2048)}, 'softmax': {'logits_ffn': {'bias': {'b': (51200,)}, 'linear': {'w': (2048, 51200)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'ffn_layer2': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'self_attention': {'combined_qkv': {'w': MaskedNode()}, 'per_dim_scale': {'per_dim_scale': MaskedNode()}, 'post': {'w': MaskedNode()}}}}}}}}}, 'v': {'params': {'lm': {'final_ln': {'bias': (2048,), 'scale': (2048,)}, 'position_emb': {'emb_var': (2048, 2048)}, 'softmax': {'logits_ffn': {'bias': {'b': (51200,)}, 'linear': {'w': (2048, 51200)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'ffn_layer2': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'self_attention': {'combined_qkv': {'w': MaskedNode()}, 'per_dim_scale': {'per_dim_scale': MaskedNode()}, 'post': {'w': MaskedNode()}}}}}}}}}}, {'count': ()}), 'p#24#i-1': ({'count': (24,)}, {'count': (24,)}, {'count': (24,), 'm': {'params': {'lm': {'final_ln': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'position_emb': {'emb_var': MaskedNode()}, 'softmax': {'logits_ffn': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': (24, 8192)}, 'linear': {'w': (24, 2048, 8192)}}, 'ffn_layer2': {'bias': {'b': (24, 2048)}, 'linear': {'w': (24, 8192, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}, 'self_attention': {'combined_qkv': {'w': (24, 3, 2048, 32, 64)}, 'per_dim_scale': {'per_dim_scale': (24, 64)}, 'post': {'w': (24, 2048, 32, 64)}}}}}}}}}, 'v': {'params': {'lm': {'final_ln': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'position_emb': {'emb_var': MaskedNode()}, 'softmax': {'logits_ffn': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': (24, 8192)}, 'linear': {'w': (24, 2048, 8192)}}, 'ffn_layer2': {'bias': {'b': (24, 2048)}, 'linear': {'w': (24, 8192, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}}, 'layer_norm': {'bias': (24, 2048), 'scale': (24, 2048)}, 'self_attention': {'combined_qkv': {'w': (24, 3, 2048, 32, 64)}, 'per_dim_scale': {'per_dim_scale': (24, 64)}, 'post': {'w': (24, 2048, 32, 64)}}}}}}}}}}, {'count': (24,)})}])
I0605 17:37:47.209682 139756486557120 partitioning.py:637] replicated train state shapes: TrainState(step=(1,), mdl_vars={'params': {'lm': {'final_ln': {'bias': (1, 2048), 'scale': (1, 2048)}, 'position_emb': {'emb_var': (1, 2048, 2048)}, 'softmax': {'logits_ffn': {'bias': {'b': (1, 51200)}, 'linear': {'w': (1, 2048, 51200)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': (1, 24, 8192)}, 'linear': {'w': (1, 24, 2048, 8192)}}, 'ffn_layer2': {'bias': {'b': (1, 24, 2048)}, 'linear': {'w': (1, 24, 8192, 2048)}}, 'layer_norm': {'bias': (1, 24, 2048), 'scale': (1, 24, 2048)}}, 'layer_norm': {'bias': (1, 24, 2048), 'scale': (1, 24, 2048)}, 'self_attention': {'combined_qkv': {'w': (1, 24, 3, 2048, 32, 64)}, 'per_dim_scale': {'per_dim_scale': (1, 24, 64)}, 'post': {'w': (1, 24, 2048, 32, 64)}}}}}}}}}, opt_states=[{'no_prefix': ({'count': (1,)}, {'count': (1,)}, {'count': (1,), 'm': {'params': {'lm': {'final_ln': {'bias': (1, 2048), 'scale': (1, 2048)}, 'position_emb': {'emb_var': (1, 2048, 2048)}, 'softmax': {'logits_ffn': {'bias': {'b': (1, 51200)}, 'linear': {'w': (1, 2048, 51200)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'ffn_layer2': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'self_attention': {'combined_qkv': {'w': MaskedNode()}, 'per_dim_scale': {'per_dim_scale': MaskedNode()}, 'post': {'w': MaskedNode()}}}}}}}}}, 'v': {'params': {'lm': {'final_ln': {'bias': (1, 2048), 'scale': (1, 2048)}, 'position_emb': {'emb_var': (1, 2048, 2048)}, 'softmax': {'logits_ffn': {'bias': {'b': (1, 51200)}, 'linear': {'w': (1, 2048, 51200)}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'ffn_layer2': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}}, 'layer_norm': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'self_attention': {'combined_qkv': {'w': MaskedNode()}, 'per_dim_scale': {'per_dim_scale': MaskedNode()}, 'post': {'w': MaskedNode()}}}}}}}}}}, {'count': (1,)}), 'p#24#i-1': ({'count': (1, 24)}, {'count': (1, 24)}, {'count': (1, 24), 'm': {'params': {'lm': {'final_ln': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'position_emb': {'emb_var': MaskedNode()}, 'softmax': {'logits_ffn': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': (1, 24, 8192)}, 'linear': {'w': (1, 24, 2048, 8192)}}, 'ffn_layer2': {'bias': {'b': (1, 24, 2048)}, 'linear': {'w': (1, 24, 8192, 2048)}}, 'layer_norm': {'bias': (1, 24, 2048), 'scale': (1, 24, 2048)}}, 'layer_norm': {'bias': (1, 24, 2048), 'scale': (1, 24, 2048)}, 'self_attention': {'combined_qkv': {'w': (1, 24, 3, 2048, 32, 64)}, 'per_dim_scale': {'per_dim_scale': (1, 24, 64)}, 'post': {'w': (1, 24, 2048, 32, 64)}}}}}}}}}, 'v': {'params': {'lm': {'final_ln': {'bias': MaskedNode(), 'scale': MaskedNode()}, 'position_emb': {'emb_var': MaskedNode()}, 'softmax': {'logits_ffn': {'bias': {'b': MaskedNode()}, 'linear': {'w': MaskedNode()}}}, 'transformer': {'repeat': {'sub': {'x_layers_0': {'ff_layer': {'ffn_layer1': {'bias': {'b': (1, 24, 8192)}, 'linear': {'w': (1, 24, 2048, 8192)}}, 'ffn_layer2': {'bias': {'b': (1, 24, 2048)}, 'linear': {'w': (1, 24, 8192, 2048)}}, 'layer_norm': {'bias': (1, 24, 2048), 'scale': (1, 24, 2048)}}, 'layer_norm': {'bias': (1, 24, 2048), 'scale': (1, 24, 2048)}, 'self_attention': {'combined_qkv': {'w': (1, 24, 3, 2048, 32, 64)}, 'per_dim_scale': {'per_dim_scale': (1, 24, 64)}, 'post': {'w': (1, 24, 2048, 32, 64)}}}}}}}}}}, {'count': (1, 24)})}])
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000c53080 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:94}, signal={0x6060001682c0:95} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:95}, signal={0x6060001682c0:96} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000882020 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c3f200 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000882020, semaphore=0x6060001682c0, value=96 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c3f200, semaphore=0x6060001682c0, value=96 (OK)
W0605 17:37:47.211523 139756486557120 pxla.py:1882] Compiling _threefry_fold_in for with global shapes and types [ShapedArray(uint32[2]), ShapedArray(uint32[])]. Argument mapping: (GSPMDSharding({replicated}), GSPMDSharding({replicated})).
W0605 17:37:47.269204 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_threefry_fold_in) in 0.05749940872192383 sec
W0605 17:37:48.786752 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_threefry_fold_in) in 1.5170984268188477 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600042b8a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600042b8a0, semaphore=0x6060001683e0, value=69 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=70, fence=0x60400042c5d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600042b8a0, from_fence=0x606000328ee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003292a0, semaphore=0x6060001683e0, value=70 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600042b8a0, from_fence=0x606000882020 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c3f200, semaphore=0x6060001683e0, value=70 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0008228a0, f=0, wait_fence=0x60600042b8a0 {0x6060001683e0:69, 0x6060001682c0:96}, signal_fence=0x60400042c5d0 {0x6060001683e0:70} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006155c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600014e580 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006155c0, semaphore=0x6060001683e0, value=70 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600014e580, semaphore=0x6060001683e0, value=70 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000c54580, wait={0x6060001682c0:96, 0x6060001683e0:70}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x608001c844a0, wait={0x6060001683e0:70}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc14180 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:70}, signal={} (OK)
I0605 17:37:48.789292 139756486557120 partitioning.py:647] root prng key: [3199903509 2250625448]
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x608001c84420, wait={0x6060001683e0:9}, signal={} (OK)
W0605 17:37:48.797018 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00022339820861816406 sec
W0605 17:37:48.797891 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0017151832580566406 sec
W0605 17:37:48.798967 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.003200054168701172 sec
W0605 17:37:48.799690 139756486557120 pxla.py:1882] Compiling _threefry_split_original for with global shapes and types [ShapedArray(uint32[2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:48.992391 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003879070281982422 sec
W0605 17:37:48.993552 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003216266632080078 sec
W0605 17:37:48.994482 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003116130828857422 sec
W0605 17:37:48.995203 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003108978271484375 sec
W0605 17:37:49.045017 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_threefry_split_original) in 0.24518394470214844 sec
W0605 17:37:51.420535 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_threefry_split_original) in 2.3751134872436523 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a76ba0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a76ba0, semaphore=0x6060001683e0, value=70 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=71, fence=0x604000dd4f50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000a76ba0, from_fence=0x6060006155c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600014e580, semaphore=0x6060001683e0, value=71 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000a10140, f=0, wait_fence=0x606000a76ba0 {0x6060001683e0:70}, signal_fence=0x604000dd4f50 {0x6060001683e0:71} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006b5160 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b37980 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b5160, semaphore=0x6060001683e0, value=71 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b37980, semaphore=0x6060001683e0, value=71 (OK)
W0605 17:37:51.424489 139756486557120 dispatch.py:272] Finished tracing + transforming _unstack for pjit in 0.001565694808959961 sec
W0605 17:37:51.425680 139756486557120 pxla.py:1882] Compiling _unstack for with global shapes and types [ShapedArray(uint32[3,2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:51.432903 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_unstack) in 0.006999969482421875 sec
W0605 17:37:51.687622 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_unstack) in 0.25420069694519043 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060000a7300 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060000a7300, semaphore=0x6060001683e0, value=71 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=72, fence=0x60400156c090 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060000a7300, from_fence=0x6060006b5160 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b37980, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000842820, f=0, wait_fence=0x6060000a7300 {0x6060001683e0:71}, signal_fence=0x60400156c090 {0x6060001683e0:72} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000af6700 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003b2ca0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000af6700, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003b2ca0, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c003e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009390e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c003e0, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009390e0, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600037fd00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600038e700 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600037fd00, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600038e700, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000387e80, wait={0x6060001683e0:72}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc14520 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:72}, signal={} (OK)
I0605 17:37:51.688758 139756486557120 executors.py:260] train prng seed: [3373580220 3771856083]
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc14520 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:72}, signal={} (OK)
I0605 17:37:51.689568 139756486557120 executors.py:261] eval prng seed: [3893388808 331134876]
W0605 17:37:51.691859 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.0013885498046875 sec
W0605 17:37:51.692612 139756486557120 pxla.py:1882] Compiling _threefry_split_original for with global shapes and types [ShapedArray(uint32[2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:37:51.749565 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_threefry_split_original) in 0.05680084228515625 sec
W0605 17:37:53.214284 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_threefry_split_original) in 1.4642627239227295 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c8c3c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c8c3c0, semaphore=0x6060001683e0, value=72 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=73, fence=0x604001124f50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000c8c3c0, from_fence=0x606000c003e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009390e0, semaphore=0x6060001683e0, value=73 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0005ea050, f=0, wait_fence=0x606000c8c3c0 {0x6060001683e0:72}, signal_fence=0x604001124f50 {0x6060001683e0:73} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000359e40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008a3080 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000359e40, semaphore=0x6060001683e0, value=73 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008a3080, semaphore=0x6060001683e0, value=73 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x608002259820, wait={0x6060001683e0:73}, signal={} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008c2160 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c2160, semaphore=0x6060001683e0, value=73 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=74, fence=0x604001125cd0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060008c2160, from_fence=0x60600037fd00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600038e700, semaphore=0x6060001683e0, value=74 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0005ea050, f=0, wait_fence=0x6060008c2160 {0x6060001683e0:73}, signal_fence=0x604001125cd0 {0x6060001683e0:74} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008a31a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000abf920 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008a31a0, semaphore=0x6060001683e0, value=74 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000abf920, semaphore=0x6060001683e0, value=74 (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60800397ffa0, wait={0x6060001683e0:74}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00029f140, wait={0x6060001683e0:71}, signal={} (OK)
I0605 17:37:53.216853 139756486557120 executors.py:295] Starting executor.
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc160c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:94}, signal={} (OK)
I0605 17:37:53.217641 139756486557120 executors.py:454] Model initial global_step=0
I0605 17:37:53.217703 139756486557120 executors.py:461] [PAX STATUS]: Starting training loop.
I0605 17:37:53.217766 139756486557120 programs.py:210] [PAX STATUS]: Setting up BaseTrainProgram.
I0605 17:37:53.217862 139756486557120 summary_utils.py:281] Opening SummaryWriter `log_NVIDIA1_3BPmap/summaries/train`...
I0605 17:37:53.219248 139756486557120 summary_utils.py:281] Opening SummaryWriter `log_NVIDIA1_3BPmap/summaries/eval_train`...
I0605 17:37:53.226810 139756486557120 py_utils.py:338] Starting sync_global_devices Start training loop from step: 0 across 1 devices globally
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00084f880 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:96}, signal={0x6060001682c0:97} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:97}, signal={0x6060001682c0:98} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600052c760 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000189920 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600052c760, semaphore=0x6060001682c0, value=98 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000189920, semaphore=0x6060001682c0, value=98 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a11320 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a11320, semaphore=0x6060001683e0, value=74 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=75, fence=0x60400056c590 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000a11320, from_fence=0x60600052c760 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000189920, semaphore=0x6060001683e0, value=75 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x606000a11320 {0x6060001683e0:74, 0x6060001682c0:98}, signal_fence=0x60400056c590 {0x6060001683e0:75} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004e58a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008859e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004e58a0, semaphore=0x6060001683e0, value=75 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008859e0, semaphore=0x6060001683e0, value=75 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc149e0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:75}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00084f7c0, wait={0x6060001682c0:98, 0x6060001683e0:75}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00084f7c0, wait={0x6060001683e0:75}, signal={} (OK)
I0605 17:37:53.229556 139756486557120 py_utils.py:341] Finished sync_global_devices Start training loop from step: 0 across 1 devices globally
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000b2a080 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:98}, signal={0x6060001682c0:99} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:99}, signal={0x6060001682c0:100} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003aa000 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003a9ee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003aa000, semaphore=0x6060001682c0, value=100 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a9ee0, semaphore=0x6060001682c0, value=100 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0006aa380 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:100}, signal={0x6060001682c0:101} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:101}, signal={0x6060001682c0:102} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600047ea60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008a2780 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600047ea60, semaphore=0x6060001682c0, value=102 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008a2780, semaphore=0x6060001682c0, value=102 (OK)
W0605 17:37:53.425011 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0005414485931396484 sec
W0605 17:37:53.425846 139756486557120 dispatch.py:272] Finished tracing + transforming _psum for pjit in 0.0017685890197753906 sec
W0605 17:37:53.426665 139756486557120 pxla.py:1882] Compiling _psum for with global shapes and types [ShapedArray(int32[1]), ShapedArray(int32[1])]. Argument mapping: (GSPMDSharding({maximal device=0}), GSPMDSharding({maximal device=0})).
W0605 17:37:53.432121 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_psum) in 0.0052835941314697266 sec
W0605 17:37:53.736919 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_psum) in 0.3044319152832031 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002cf660 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002cf660, semaphore=0x6060001683e0, value=75 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=76, fence=0x6040016b70d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060002cf660, from_fence=0x6060003aa000 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a9ee0, semaphore=0x6060001683e0, value=76 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060002cf660, from_fence=0x60600047ea60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008a2780, semaphore=0x6060001683e0, value=76 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004abaa0, f=0, wait_fence=0x6060002cf660 {0x6060001683e0:75, 0x6060001682c0:102}, signal_fence=0x6040016b70d0 {0x6060001683e0:76} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ad7740 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c95720 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ad7740, semaphore=0x6060001683e0, value=76 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c95720, semaphore=0x6060001683e0, value=76 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600031bf20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004fb560 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600031bf20, semaphore=0x6060001683e0, value=76 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004fb560, semaphore=0x6060001683e0, value=76 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13bc0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:76}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13c00 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:76}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000912dc0, wait={0x6060001682c0:102, 0x6060001683e0:76}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000ca0d80, wait={0x6060001682c0:100, 0x6060001683e0:76}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000912dc0, wait={0x6060001683e0:76}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000ca0d80, wait={0x6060001683e0:76}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000285640 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:102}, signal={0x6060001682c0:103} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:103}, signal={0x6060001682c0:104} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a3b380 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005b2260 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a3b380, semaphore=0x6060001682c0, value=104 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005b2260, semaphore=0x6060001682c0, value=104 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004084a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004084a0, semaphore=0x6060001683e0, value=76 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=77, fence=0x604000e0f4d0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060004084a0, from_fence=0x606000a3b380 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005b2260, semaphore=0x6060001683e0, value=77 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x6060004084a0 {0x6060001683e0:76, 0x6060001682c0:104}, signal_fence=0x604000e0f4d0 {0x6060001683e0:77} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000845b40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b04c20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000845b40, semaphore=0x6060001683e0, value=77 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b04c20, semaphore=0x6060001683e0, value=77 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc139e0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:77}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000285400, wait={0x6060001682c0:104, 0x6060001683e0:77}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000285400, wait={0x6060001683e0:77}, signal={} (OK)
I0605 17:37:53.741610 139756486557120 checkpointer.py:67] Saving item to log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/state.
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00032d880 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:104}, signal={0x6060001682c0:105} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:105}, signal={0x6060001682c0:106} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006963e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a81220 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006963e0, semaphore=0x6060001682c0, value=106 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a81220, semaphore=0x6060001682c0, value=106 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000aab3c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:106}, signal={0x6060001682c0:107} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:107}, signal={0x6060001682c0:108} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005cf000 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008a9e00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005cf000, semaphore=0x6060001682c0, value=108 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008a9e00, semaphore=0x6060001682c0, value=108 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b9d800 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b9d800, semaphore=0x6060001683e0, value=77 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=78, fence=0x6040013f2a90 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000b9d800, from_fence=0x6060006963e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a81220, semaphore=0x6060001683e0, value=78 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000b9d800, from_fence=0x6060005cf000 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008a9e00, semaphore=0x6060001683e0, value=78 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004abaa0, f=0, wait_fence=0x606000b9d800 {0x6060001683e0:77, 0x6060001682c0:108}, signal_fence=0x6040013f2a90 {0x6060001683e0:78} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000603080 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000603560 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000603080, semaphore=0x6060001683e0, value=78 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000603560, semaphore=0x6060001683e0, value=78 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060009b2d60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005f2c40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009b2d60, semaphore=0x6060001683e0, value=78 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005f2c40, semaphore=0x6060001683e0, value=78 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13d40 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:78}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13d80 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:78}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000116500, wait={0x6060001682c0:108, 0x6060001683e0:78}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0003b5240, wait={0x6060001682c0:106, 0x6060001683e0:78}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000116500, wait={0x6060001683e0:78}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0003b5240, wait={0x6060001683e0:78}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000843040 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:108}, signal={0x6060001682c0:109} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:109}, signal={0x6060001682c0:110} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a94000 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007b61a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a94000, semaphore=0x6060001682c0, value=110 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007b61a0, semaphore=0x6060001682c0, value=110 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600092a740 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600092a740, semaphore=0x6060001683e0, value=78 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=79, fence=0x6040014e0250 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600092a740, from_fence=0x606000a94000 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007b61a0, semaphore=0x6060001683e0, value=79 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x60600092a740 {0x6060001683e0:78, 0x6060001682c0:110}, signal_fence=0x6040014e0250 {0x6060001683e0:79} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003d4060 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000910dc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003d4060, semaphore=0x6060001683e0, value=79 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000910dc0, semaphore=0x6060001683e0, value=79 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13b60 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:79}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000842ec0, wait={0x6060001682c0:110, 0x6060001683e0:79}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000842ec0, wait={0x6060001683e0:79}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:94}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:11}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:14}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:16}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:38}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:12}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:14}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:16}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:18}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:20}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:13}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:15}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:17}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:19}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:21}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:40}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:44}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:45}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:46}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:47}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:48}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:49}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:50}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:51}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:52}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:53}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:54}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:55}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:56}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:57}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:58}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:59}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:60}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:61}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:62}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:63}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:64}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:65}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:66}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:67}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:68}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc148c0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:69}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0005a61c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:110}, signal={0x6060001682c0:111} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:111}, signal={0x6060001682c0:112} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004a1260 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600049cee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004a1260, semaphore=0x6060001682c0, value=112 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600049cee0, semaphore=0x6060001682c0, value=112 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600008f2a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600008f2a0, semaphore=0x6060001683e0, value=79 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=80, fence=0x6040014e0fd0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600008f2a0, from_fence=0x6060004a1260 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600049cee0, semaphore=0x6060001683e0, value=80 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x60600008f2a0 {0x6060001683e0:79, 0x6060001682c0:112}, signal_fence=0x6040014e0fd0 {0x6060001683e0:80} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000715580 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008ae960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000715580, semaphore=0x6060001683e0, value=80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008ae960, semaphore=0x6060001683e0, value=80 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc12a60 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:80}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000840040, wait={0x6060001682c0:112, 0x6060001683e0:80}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000840040, wait={0x6060001683e0:80}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0008c2180 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:112}, signal={0x6060001682c0:113} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:113}, signal={0x6060001682c0:114} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060001ad4a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004fb020 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001ad4a0, semaphore=0x6060001682c0, value=114 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004fb020, semaphore=0x6060001682c0, value=114 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c65fc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c65fc0, semaphore=0x6060001683e0, value=80 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=81, fence=0x6040014e0fd0 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000c65fc0, from_fence=0x6060001ad4a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004fb020, semaphore=0x6060001683e0, value=81 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x606000c65fc0 {0x6060001683e0:80, 0x6060001682c0:114}, signal_fence=0x6040014e0fd0 {0x6060001683e0:81} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b320a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b32760 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b320a0, semaphore=0x6060001683e0, value=81 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b32760, semaphore=0x6060001683e0, value=81 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13ac0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:81}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0008c2240, wait={0x6060001682c0:114, 0x6060001683e0:81}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0008c2240, wait={0x6060001683e0:81}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000887140 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:114}, signal={0x6060001682c0:115} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:115}, signal={0x6060001682c0:116} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000971c00 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000971f00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000971c00, semaphore=0x6060001682c0, value=116 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000971f00, semaphore=0x6060001682c0, value=116 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000673be0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000673be0, semaphore=0x6060001683e0, value=81 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=82, fence=0x6040003e5f10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x606000673be0, from_fence=0x606000971c00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000971f00, semaphore=0x6060001683e0, value=82 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x606000673be0 {0x6060001683e0:81, 0x6060001682c0:116}, signal_fence=0x6040003e5f10 {0x6060001683e0:82} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004c0b20 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000604880 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004c0b20, semaphore=0x6060001683e0, value=82 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000604880, semaphore=0x6060001683e0, value=82 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13d40 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:82}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000887080, wait={0x6060001682c0:116, 0x6060001683e0:82}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000887080, wait={0x6060001683e0:82}, signal={} (OK)
I0605 17:38:48.673480 139756486557120 utils.py:465] Renaming log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/state.orbax-checkpoint-tmp-1685986673741760 to log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/state
I0605 17:38:48.673784 139756486557120 utils.py:509] Finished saving checkpoint to `log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/state`.
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000b785c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:116}, signal={0x6060001682c0:117} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:117}, signal={0x6060001682c0:118} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600096d6a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000875f00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600096d6a0, semaphore=0x6060001682c0, value=118 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000875f00, semaphore=0x6060001682c0, value=118 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006c6980 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006c6980, semaphore=0x6060001683e0, value=82 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=83, fence=0x60400042e250 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060006c6980, from_fence=0x60600096d6a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000875f00, semaphore=0x6060001683e0, value=83 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x6060006c6980 {0x6060001683e0:82, 0x6060001682c0:118}, signal_fence=0x60400042e250 {0x6060001683e0:83} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600062b280 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007c6460 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600062b280, semaphore=0x6060001683e0, value=83 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007c6460, semaphore=0x6060001683e0, value=83 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13d40 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:83}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000390580, wait={0x6060001682c0:118, 0x6060001683e0:83}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000390580, wait={0x6060001683e0:83}, signal={} (OK)
I0605 17:38:48.676459 139756486557120 checkpointer.py:67] Saving item to log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/metadata.
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00054f100 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:118}, signal={0x6060001682c0:119} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:119}, signal={0x6060001682c0:120} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004b38c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000375680 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004b38c0, semaphore=0x6060001682c0, value=120 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000375680, semaphore=0x6060001682c0, value=120 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c0008cff80 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:120}, signal={0x6060001682c0:121} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:121}, signal={0x6060001682c0:122} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000256ca0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600065abc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000256ca0, semaphore=0x6060001682c0, value=122 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600065abc0, semaphore=0x6060001682c0, value=122 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600059b460 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600059b460, semaphore=0x6060001683e0, value=83 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=84, fence=0x604000858590 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600059b460, from_fence=0x6060004b38c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000375680, semaphore=0x6060001683e0, value=84 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600059b460, from_fence=0x606000256ca0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600065abc0, semaphore=0x6060001683e0, value=84 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b0004abaa0, f=0, wait_fence=0x60600059b460 {0x6060001683e0:83, 0x6060001682c0:122}, signal_fence=0x604000858590 {0x6060001683e0:84} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000d3b0e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060002890a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d3b0e0, semaphore=0x6060001683e0, value=84 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002890a0, semaphore=0x6060001683e0, value=84 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a588a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000788960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a588a0, semaphore=0x6060001683e0, value=84 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000788960, semaphore=0x6060001683e0, value=84 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13d80 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:84}, signal={} (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13dc0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:84}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0008cfec0, wait={0x6060001682c0:122, 0x6060001683e0:84}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0007d9140, wait={0x6060001682c0:120, 0x6060001683e0:84}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0008cfec0, wait={0x6060001683e0:84}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0007d9140, wait={0x6060001683e0:84}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00025b640 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:122}, signal={0x6060001682c0:123} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:123}, signal={0x6060001682c0:124} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005eb320 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060004427c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005eb320, semaphore=0x6060001682c0, value=124 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004427c0, semaphore=0x6060001682c0, value=124 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060005237c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005237c0, semaphore=0x6060001683e0, value=84 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=85, fence=0x6040005b3310 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060005237c0, from_fence=0x6060005eb320 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004427c0, semaphore=0x6060001683e0, value=85 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x6060005237c0 {0x6060001683e0:84, 0x6060001682c0:124}, signal_fence=0x6040005b3310 {0x6060001683e0:85} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060003a0820 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b66720 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a0820, semaphore=0x6060001683e0, value=85 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b66720, semaphore=0x6060001683e0, value=85 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13bc0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:85}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00025bb80, wait={0x6060001682c0:124, 0x6060001683e0:85}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00025bb80, wait={0x6060001683e0:85}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c00098fe00 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:124}, signal={0x6060001682c0:125} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:125}, signal={0x6060001682c0:126} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600070c5e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006cc260 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600070c5e0, semaphore=0x6060001682c0, value=126 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006cc260, semaphore=0x6060001682c0, value=126 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600078b900 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600078b900, semaphore=0x6060001683e0, value=85 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=86, fence=0x604001016790 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600078b900, from_fence=0x60600070c5e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006cc260, semaphore=0x6060001683e0, value=86 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x60600078b900 {0x6060001683e0:85, 0x6060001682c0:126}, signal_fence=0x604001016790 {0x6060001683e0:86} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000736e80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000536780 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000736e80, semaphore=0x6060001683e0, value=86 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000536780, semaphore=0x6060001683e0, value=86 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13b60 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:86}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00098fec0, wait={0x6060001682c0:126, 0x6060001683e0:86}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c00098fec0, wait={0x6060001683e0:86}, signal={} (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000769780 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:126}, signal={0x6060001682c0:127} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:127}, signal={0x6060001682c0:128} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007da560 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000cf62c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007da560, semaphore=0x6060001682c0, value=128 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cf62c0, semaphore=0x6060001682c0, value=128 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600059a1a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600059a1a0, semaphore=0x6060001683e0, value=86 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=87, fence=0x604001515d10 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600059a1a0, from_fence=0x6060007da560 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cf62c0, semaphore=0x6060001683e0, value=87 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x60600059a1a0 {0x6060001683e0:86, 0x6060001682c0:128}, signal_fence=0x604001515d10 {0x6060001683e0:87} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007b2e40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060007c4420 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007b2e40, semaphore=0x6060001683e0, value=87 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007c4420, semaphore=0x6060001683e0, value=87 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13d80 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:87}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000769600, wait={0x6060001682c0:128, 0x6060001683e0:87}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c000769600, wait={0x6060001683e0:87}, signal={} (OK)
I0605 17:38:48.687450 139756486557120 utils.py:465] Renaming log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/metadata.orbax-checkpoint-tmp-1685986728676589 to log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/metadata
I0605 17:38:48.687598 139756486557120 utils.py:509] Finished saving checkpoint to `log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257/metadata`.
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000b5b340 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:128}, signal={0x6060001682c0:129} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:129}, signal={0x6060001682c0:130} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000330e60 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a94000 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000330e60, semaphore=0x6060001682c0, value=130 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a94000, semaphore=0x6060001682c0, value=130 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060008dd7c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008dd7c0, semaphore=0x6060001683e0, value=87 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=88, fence=0x604000f7d010 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060008dd7c0, from_fence=0x606000330e60 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a94000, semaphore=0x6060001683e0, value=88 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x6060008dd7c0 {0x6060001683e0:87, 0x6060001682c0:130}, signal_fence=0x604000f7d010 {0x6060001683e0:88} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600087c7a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000ccc140 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600087c7a0, semaphore=0x6060001683e0, value=88 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000ccc140, semaphore=0x6060001683e0, value=88 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13d80 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:88}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0002a1b40, wait={0x6060001682c0:130, 0x6060001683e0:88}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0002a1b40, wait={0x6060001683e0:88}, signal={} (OK)
I0605 17:38:48.689893 139756486557120 utils.py:465] Renaming log_NVIDIA1_3BPmap/checkpoints/checkpoint_0.orbax-checkpoint-tmp-1685986673422257 to log_NVIDIA1_3BPmap/checkpoints/checkpoint_0
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c000282d00 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:130}, signal={0x6060001682c0:131} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:131}, signal={0x6060001682c0:132} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000a0a4e0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000486c20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a0a4e0, semaphore=0x6060001682c0, value=132 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000486c20, semaphore=0x6060001682c0, value=132 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060006c8ba0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006c8ba0, semaphore=0x6060001683e0, value=88 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=89, fence=0x604000600650 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x6060006c8ba0, from_fence=0x606000a0a4e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000486c20, semaphore=0x6060001683e0, value=89 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b00038c290, f=0, wait_fence=0x6060006c8ba0 {0x6060001683e0:88, 0x6060001682c0:132}, signal_fence=0x604000600650 {0x6060001683e0:89} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b9d8c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000b9db00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b9d8c0, semaphore=0x6060001683e0, value=89 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b9db00, semaphore=0x6060001683e0, value=89 (OK)
:: IREE INVOKE (hal_allocator_import_buffer): external_buffer=0x7ffdadc13fc0 (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001683e0:89}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0009bfbc0, wait={0x6060001682c0:132, 0x6060001683e0:89}, signal={} (OK)
:: IREE INVOKE (hal_device_queue_dealloca): device=0x6110011f9800, buffer=0x60c0009bfbc0, wait={0x6060001683e0:89}, signal={} (OK)
W0605 17:38:50.088575 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00046539306640625 sec
W0605 17:38:50.089799 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004601478576660156 sec
W0605 17:38:50.090771 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003387928009033203 sec
I0605 17:38:50.104525 139756486557120 base_layer.py:632] Creating var /lm/softmax/logits_ffn/linear/w with shape=[2048, 51200], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.022097086912079608
W0605 17:38:50.106836 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004742145538330078 sec
W0605 17:38:50.107717 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003573894500732422 sec
W0605 17:38:50.108556 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003871917724609375 sec
W0605 17:38:50.109402 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038361549377441406 sec
W0605 17:38:50.110023 139756486557120 dispatch.py:272] Finished tracing + transforming _uniform for pjit in 0.004520893096923828 sec
W0605 17:38:50.110689 139756486557120 dispatch.py:272] Finished tracing + transforming _normal_real for pjit in 0.005500078201293945 sec
W0605 17:38:50.111018 139756486557120 dispatch.py:272] Finished tracing + transforming _normal for pjit in 0.006124973297119141 sec
W0605 17:38:50.111812 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004010200500488281 sec
W0605 17:38:50.115243 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00039196014404296875 sec
W0605 17:38:50.116189 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.00017595291137695312 sec
W0605 17:38:50.117643 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00039315223693847656 sec
I0605 17:38:50.120159 139756486557120 base_layer.py:632] Creating var /lm/position_emb/emb_var with shape=[2048, 2048], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
W0605 17:38:50.122282 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.000461578369140625 sec
W0605 17:38:50.123463 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003762245178222656 sec
W0605 17:38:50.124331 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038623809814453125 sec
W0605 17:38:50.124928 139756486557120 dispatch.py:272] Finished tracing + transforming _uniform for pjit in 0.0038928985595703125 sec
W0605 17:38:50.125589 139756486557120 dispatch.py:272] Finished tracing + transforming _normal_real for pjit in 0.004845380783081055 sec
W0605 17:38:50.125927 139756486557120 dispatch.py:272] Finished tracing + transforming _normal for pjit in 0.005454301834106445 sec
W0605 17:38:50.126752 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00040984153747558594 sec
W0605 17:38:50.131844 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00037741661071777344 sec
W0605 17:38:50.132391 139756486557120 dispatch.py:272] Finished tracing + transforming _one_hot for pjit in 0.0014386177062988281 sec
W0605 17:38:50.133401 139756486557120 dispatch.py:272] Finished tracing + transforming matmul for pjit in 0.0005931854248046875 sec
W0605 17:38:50.136929 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.0006892681121826172 sec
W0605 17:38:50.139818 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003864765167236328 sec
W0605 17:38:50.142343 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003662109375 sec
W0605 17:38:50.143554 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00036787986755371094 sec
W0605 17:38:50.146310 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003838539123535156 sec
W0605 17:38:50.147240 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003628730773925781 sec
W0605 17:38:50.148331 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003795623779296875 sec
W0605 17:38:50.226294 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004551410675048828 sec
W0605 17:38:50.227395 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00037407875061035156 sec
W0605 17:38:50.228480 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004832744598388672 sec
W0605 17:38:50.230334 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004317760467529297 sec
W0605 17:38:50.231549 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00034928321838378906 sec
W0605 17:38:50.232604 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.00027489662170410156 sec
W0605 17:38:50.234390 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00034546852111816406 sec
W0605 17:38:50.235389 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002620220184326172 sec
W0605 17:38:50.236262 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00040340423583984375 sec
W0605 17:38:50.237334 139756486557120 dispatch.py:272] Finished tracing + transforming _power for pjit in 0.0004417896270751953 sec
W0605 17:38:50.238777 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00045418739318847656 sec
W0605 17:38:50.239361 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.001392364501953125 sec
W0605 17:38:50.240437 139756486557120 dispatch.py:272] Finished tracing + transforming _power for pjit in 0.00044536590576171875 sec
I0605 17:38:50.241358 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:38:50.242370 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0005207061767578125 sec
I0605 17:38:50.243339 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:38:50.245457 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0005881786346435547 sec
W0605 17:38:50.246164 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0016107559204101562 sec
W0605 17:38:50.247194 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00035190582275390625 sec
W0605 17:38:50.249184 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00042319297790527344 sec
W0605 17:38:50.250619 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004248619079589844 sec
W0605 17:38:50.251869 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0006654262542724609 sec
W0605 17:38:50.253045 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0005459785461425781 sec
I0605 17:38:50.265692 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/combined_qkv/w with shape=[3, 2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
W0605 17:38:50.268131 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0005102157592773438 sec
W0605 17:38:50.269049 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003743171691894531 sec
W0605 17:38:50.269925 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00040221214294433594 sec
W0605 17:38:50.270813 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003933906555175781 sec
W0605 17:38:50.271438 139756486557120 dispatch.py:272] Finished tracing + transforming _uniform for pjit in 0.004752159118652344 sec
W0605 17:38:50.272136 139756486557120 dispatch.py:272] Finished tracing + transforming _normal_real for pjit in 0.005776882171630859 sec
W0605 17:38:50.272483 139756486557120 dispatch.py:272] Finished tracing + transforming _normal for pjit in 0.006403684616088867 sec
W0605 17:38:50.273355 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00045037269592285156 sec
W0605 17:38:50.276827 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.0007164478302001953 sec
I0605 17:38:50.278644 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/per_dim_scale/per_dim_scale with shape=[64], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:38:50.279554 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00043129920959472656 sec
W0605 17:38:50.282353 139756486557120 dispatch.py:272] Finished tracing + transforming logaddexp for pjit in 0.0012598037719726562 sec
W0605 17:38:50.282917 139756486557120 dispatch.py:272] Finished tracing + transforming softplus for pjit in 0.002315044403076172 sec
W0605 17:38:50.283740 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038123130798339844 sec
W0605 17:38:50.284803 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0006012916564941406 sec
W0605 17:38:50.286297 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.00058746337890625 sec
W0605 17:38:50.287535 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00041556358337402344 sec
W0605 17:38:50.288473 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038361549377441406 sec
W0605 17:38:50.290702 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0014166831970214844 sec
W0605 17:38:50.291424 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0024847984313964844 sec
W0605 17:38:50.292345 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.0004284381866455078 sec
W0605 17:38:50.293292 139756486557120 dispatch.py:272] Finished tracing + transforming _power for pjit in 0.0004420280456542969 sec
W0605 17:38:50.294705 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00046181678771972656 sec
W0605 17:38:50.295297 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.00140380859375 sec
W0605 17:38:50.296829 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00034689903259277344 sec
W0605 17:38:50.297558 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002791881561279297 sec
W0605 17:38:50.298322 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003561973571777344 sec
W0605 17:38:50.300544 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.0005462169647216797 sec
W0605 17:38:50.301525 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003635883331298828 sec
W0605 17:38:50.302242 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002760887145996094 sec
W0605 17:38:50.303315 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0006041526794433594 sec
W0605 17:38:50.304219 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003628730773925781 sec
W0605 17:38:50.305894 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.0006916522979736328 sec
I0605 17:38:50.306929 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/post/w with shape=[2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
W0605 17:38:50.309180 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003814697265625 sec
W0605 17:38:50.310077 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00038695335388183594 sec
W0605 17:38:50.310939 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003902912139892578 sec
W0605 17:38:50.311874 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004658699035644531 sec
W0605 17:38:50.312483 139756486557120 dispatch.py:272] Finished tracing + transforming _uniform for pjit in 0.004551410675048828 sec
W0605 17:38:50.313171 139756486557120 dispatch.py:272] Finished tracing + transforming _normal_real for pjit in 0.005565643310546875 sec
W0605 17:38:50.313507 139756486557120 dispatch.py:272] Finished tracing + transforming _normal for pjit in 0.006251096725463867 sec
W0605 17:38:50.314344 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004253387451171875 sec
W0605 17:38:50.317789 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.0006120204925537109 sec
W0605 17:38:50.321920 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00035834312438964844 sec
W0605 17:38:50.322905 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004630088806152344 sec
W0605 17:38:50.325140 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004410743713378906 sec
W0605 17:38:50.326235 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00044727325439453125 sec
W0605 17:38:50.328002 139756486557120 dispatch.py:272] Finished tracing + transforming _power for pjit in 0.0004172325134277344 sec
W0605 17:38:50.328908 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003478527069091797 sec
W0605 17:38:50.330873 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.000431060791015625 sec
W0605 17:38:50.331428 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.001316070556640625 sec
I0605 17:38:50.345351 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.346637 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.359563 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/linear/w with shape=[2048, 8192], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
W0605 17:38:50.361818 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00038051605224609375 sec
W0605 17:38:50.363132 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00048160552978515625 sec
W0605 17:38:50.363966 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003731250762939453 sec
W0605 17:38:50.364569 139756486557120 dispatch.py:272] Finished tracing + transforming _uniform for pjit in 0.0040357112884521484 sec
W0605 17:38:50.365290 139756486557120 dispatch.py:272] Finished tracing + transforming _normal_real for pjit in 0.005079507827758789 sec
W0605 17:38:50.365629 139756486557120 dispatch.py:272] Finished tracing + transforming _normal for pjit in 0.005695819854736328 sec
W0605 17:38:50.366461 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004315376281738281 sec
W0605 17:38:50.369798 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.0006268024444580078 sec
I0605 17:38:50.370664 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/bias/b with shape=[8192], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:38:50.371565 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00043845176696777344 sec
W0605 17:38:50.373037 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0005903244018554688 sec
W0605 17:38:50.374403 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004355907440185547 sec
W0605 17:38:50.375318 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003592967987060547 sec
W0605 17:38:50.376131 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003597736358642578 sec
W0605 17:38:50.376911 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002892017364501953 sec
W0605 17:38:50.377897 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0005381107330322266 sec
W0605 17:38:50.379199 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003714561462402344 sec
W0605 17:38:50.380480 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003561973571777344 sec
W0605 17:38:50.381428 139756486557120 dispatch.py:272] Finished tracing + transforming _power for pjit in 0.00043392181396484375 sec
W0605 17:38:50.382789 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00045108795166015625 sec
W0605 17:38:50.383347 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0013604164123535156 sec
I0605 17:38:50.391109 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/linear/w with shape=[8192, 2048], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
W0605 17:38:50.393342 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00039124488830566406 sec
W0605 17:38:50.394674 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004923343658447266 sec
W0605 17:38:50.395514 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00037169456481933594 sec
W0605 17:38:50.396109 139756486557120 dispatch.py:272] Finished tracing + transforming _uniform for pjit in 0.004067897796630859 sec
W0605 17:38:50.396795 139756486557120 dispatch.py:272] Finished tracing + transforming _normal_real for pjit in 0.005061626434326172 sec
W0605 17:38:50.397140 139756486557120 dispatch.py:272] Finished tracing + transforming _normal for pjit in 0.0056803226470947266 sec
W0605 17:38:50.397976 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004203319549560547 sec
W0605 17:38:50.401306 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.0006275177001953125 sec
I0605 17:38:50.402132 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/bias/b with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.458132 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.459451 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.476639 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/combined_qkv/w with shape=[3, 2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:38:50.481575 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/per_dim_scale/per_dim_scale with shape=[64], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.493132 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/self_attention/post/w with shape=[2048, 32, 64], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:38:50.520059 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.521409 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/layer_norm/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.535170 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/linear/w with shape=[2048, 8192], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:38:50.539281 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer1/bias/b with shape=[8192], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.553280 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/linear/w with shape=[8192, 2048], dtype=<class 'jax.numpy.float32'>, init method=gaussian and scale=0.023
I0605 17:38:50.557278 139756486557120 base_layer.py:632] Creating var /lm/transformer/repeat/remat(scan(map_variables(map_variables(map_variables(sub)))))/x_layers_0/ff_layer/ffn_layer2/bias/b with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:38:50.586362 139756486557120 dispatch.py:272] Finished tracing + transforming logaddexp for pjit in 0.0009326934814453125 sec
W0605 17:38:50.587100 139756486557120 dispatch.py:272] Finished tracing + transforming real for pjit in 0.00015854835510253906 sec
W0605 17:38:50.588630 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002460479736328125 sec
W0605 17:38:50.589258 139756486557120 dispatch.py:272] Finished tracing + transforming real for pjit in 0.00016546249389648438 sec
I0605 17:38:50.685316 139756486557120 base_layer.py:632] Creating var /lm/final_ln/scale with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
I0605 17:38:50.686667 139756486557120 base_layer.py:632] Creating var /lm/final_ln/bias with shape=[2048], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:38:50.707986 139756486557120 dispatch.py:272] Finished tracing + transforming _einsum for pjit in 0.0006191730499267578 sec
I0605 17:38:50.711300 139756486557120 base_layer.py:632] Creating var /lm/softmax/logits_ffn/bias/b with shape=[51200], dtype=<class 'jax.numpy.float32'>, init method=constant and scale=0.0
W0605 17:38:50.712251 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00042057037353515625 sec
W0605 17:38:50.713695 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0005605220794677734 sec
W0605 17:38:50.716571 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00040793418884277344 sec
W0605 17:38:50.718845 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00027251243591308594 sec
W0605 17:38:50.721838 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004107952117919922 sec
W0605 17:38:50.724678 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.0005333423614501953 sec
W0605 17:38:50.725696 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00043582916259765625 sec
W0605 17:38:50.726395 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002675056457519531 sec
W0605 17:38:50.727391 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0005466938018798828 sec
W0605 17:38:50.728161 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00026035308837890625 sec
W0605 17:38:50.728855 139756486557120 dispatch.py:272] Finished tracing + transforming log_softmax for pjit in 0.004996776580810547 sec
W0605 17:38:50.734375 139756486557120 dispatch.py:272] Finished tracing + transforming _squeeze for pjit in 0.0002372264862060547 sec
W0605 17:38:50.735692 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00036454200744628906 sec
W0605 17:38:50.736228 139756486557120 dispatch.py:272] Finished tracing + transforming _one_hot for pjit in 0.0014050006866455078 sec
W0605 17:38:50.737056 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003540515899658203 sec
W0605 17:38:50.739387 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00043201446533203125 sec
W0605 17:38:50.741913 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00024437904357910156 sec
W0605 17:38:50.743786 139756486557120 dispatch.py:272] Finished tracing + transforming _argmax for pjit in 0.00027298927307128906 sec
W0605 17:38:50.745871 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00036263465881347656 sec
W0605 17:38:50.748253 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00042724609375 sec
W0605 17:38:50.750594 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00039267539978027344 sec
W0605 17:38:50.755076 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00043892860412597656 sec
W0605 17:38:50.757273 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003552436828613281 sec
W0605 17:38:50.759236 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003609657287597656 sec
W0605 17:38:50.760184 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004966259002685547 sec
W0605 17:38:50.761510 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00043082237243652344 sec
W0605 17:38:50.762891 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00024199485778808594 sec
W0605 17:38:50.767537 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003418922424316406 sec
W0605 17:38:50.782197 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0004863739013671875 sec
W0605 17:38:50.786458 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0004134178161621094 sec
W0605 17:38:50.908352 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004239082336425781 sec
W0605 17:38:50.910371 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00039196014404296875 sec
W0605 17:38:50.911374 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004189014434814453 sec
W0605 17:38:50.912329 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00042819976806640625 sec
W0605 17:38:50.913593 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003299713134765625 sec
W0605 17:38:50.913979 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0010998249053955078 sec
W0605 17:38:50.915056 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004069805145263672 sec
W0605 17:38:50.915954 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003898143768310547 sec
W0605 17:38:50.918364 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004146099090576172 sec
W0605 17:38:50.920581 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0006053447723388672 sec
W0605 17:38:50.921390 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003533363342285156 sec
W0605 17:38:50.922576 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003867149353027344 sec
W0605 17:38:50.924040 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0005407333374023438 sec
W0605 17:38:50.925589 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00033211708068847656 sec
W0605 17:38:50.926552 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00044608116149902344 sec
W0605 17:38:50.927937 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034737586975097656 sec
W0605 17:38:50.928853 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004279613494873047 sec
W0605 17:38:50.929699 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004150867462158203 sec
W0605 17:38:50.930606 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004134178161621094 sec
W0605 17:38:50.931371 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003495216369628906 sec
W0605 17:38:50.932270 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00041294097900390625 sec
W0605 17:38:50.933025 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034499168395996094 sec
W0605 17:38:50.933944 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004177093505859375 sec
W0605 17:38:50.934698 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034165382385253906 sec
W0605 17:38:50.935618 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004241466522216797 sec
W0605 17:38:50.936379 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003333091735839844 sec
W0605 17:38:50.937386 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0005240440368652344 sec
W0605 17:38:50.938133 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003476142883300781 sec
W0605 17:38:50.939027 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004146099090576172 sec
W0605 17:38:50.942208 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00035643577575683594 sec
W0605 17:38:50.943121 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00042128562927246094 sec
W0605 17:38:50.943864 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003342628479003906 sec
W0605 17:38:50.944764 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00042128562927246094 sec
W0605 17:38:50.946032 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0008378028869628906 sec
W0605 17:38:50.946952 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00042057037353515625 sec
W0605 17:38:50.949638 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00041675567626953125 sec
W0605 17:38:50.950620 139756486557120 dispatch.py:272] Finished tracing + transforming isfinite for pjit in 0.00022411346435546875 sec
W0605 17:38:50.951457 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_all for pjit in 0.0004432201385498047 sec
W0605 17:38:50.952910 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004074573516845703 sec
W0605 17:38:50.953750 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00035572052001953125 sec
W0605 17:38:50.955095 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003368854522705078 sec
W0605 17:38:50.955853 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00032639503479003906 sec
W0605 17:38:50.956612 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003349781036376953 sec
W0605 17:38:50.957367 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00033402442932128906 sec
W0605 17:38:50.958131 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034165382385253906 sec
W0605 17:38:50.958882 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003368854522705078 sec
W0605 17:38:50.960838 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003952980041503906 sec
W0605 17:38:50.961642 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003399848937988281 sec
W0605 17:38:50.962403 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003294944763183594 sec
W0605 17:38:50.974797 139756486557120 optimizers.py:1170] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
I0605 17:38:50.974858 139756486557120 optimizers.py:1173] Using sharded_adam.
W0605 17:38:50.974896 139756486557120 optimizers.py:580] DEPRECATION WARNING: p.weight_decay will be deprecated. In future, we will do a migration to remove p.weight_decay and after that, setting it will throw an exception. In future, we will use p.l2_regularizer_weight for coupled weight decay (i.e., weight decays that affect optimizer slots), and use p.decoupled_weight_decay for decoupled weight decay (i.e., weight decays that are added only to the final update).
W0605 17:38:50.976279 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003859996795654297 sec
W0605 17:38:50.977996 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.0002586841583251953 sec
W0605 17:38:50.979636 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0008521080017089844 sec
W0605 17:38:50.980117 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0016627311706542969 sec
W0605 17:38:50.981424 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.004053592681884766 sec
W0605 17:38:50.982840 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.0002465248107910156 sec
W0605 17:38:50.984083 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.00047087669372558594 sec
W0605 17:38:50.984557 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012707710266113281 sec
W0605 17:38:50.985852 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.0035409927368164062 sec
W0605 17:38:50.986913 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.00024271011352539062 sec
W0605 17:38:50.988064 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003974437713623047 sec
W0605 17:38:50.988533 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011773109436035156 sec
W0605 17:38:50.989825 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.0034170150756835938 sec
W0605 17:38:50.990873 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.00024271011352539062 sec
W0605 17:38:50.992126 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003910064697265625 sec
W0605 17:38:50.992585 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012733936309814453 sec
W0605 17:38:50.993876 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.0034990310668945312 sec
W0605 17:38:50.995401 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00038313865661621094 sec
W0605 17:38:50.996306 139756486557120 dispatch.py:272] Finished tracing + transforming _power for pjit in 0.00038886070251464844 sec
W0605 17:38:50.997263 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00040459632873535156 sec
W0605 17:38:51.002679 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003349781036376953 sec
W0605 17:38:51.003729 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003273487091064453 sec
W0605 17:38:51.019937 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034332275390625 sec
W0605 17:38:51.021034 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00036072731018066406 sec
W0605 17:38:51.029161 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003571510314941406 sec
W0605 17:38:51.030206 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003421306610107422 sec
W0605 17:38:51.038254 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034236907958984375 sec
W0605 17:38:51.039307 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003333091735839844 sec
W0605 17:38:51.041758 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.000400543212890625 sec
W0605 17:38:51.042524 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002651214599609375 sec
W0605 17:38:51.044231 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003452301025390625 sec
W0605 17:38:51.046305 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038170814514160156 sec
W0605 17:38:51.047033 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002536773681640625 sec
W0605 17:38:51.048109 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00033783912658691406 sec
W0605 17:38:51.048935 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038361549377441406 sec
W0605 17:38:51.049658 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002503395080566406 sec
W0605 17:38:51.050806 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00040531158447265625 sec
W0605 17:38:51.051653 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003936290740966797 sec
W0605 17:38:51.052373 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00025343894958496094 sec
W0605 17:38:51.053460 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003292560577392578 sec
W0605 17:38:51.054191 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.0002491474151611328 sec
W0605 17:38:51.055417 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004432201385498047 sec
W0605 17:38:51.055958 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.001308441162109375 sec
W0605 17:38:51.057440 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.0002446174621582031 sec
W0605 17:38:51.059051 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.0022339820861816406 sec
W0605 17:38:51.060664 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003437995910644531 sec
W0605 17:38:51.063558 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.00025272369384765625 sec
W0605 17:38:51.064804 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00048232078552246094 sec
W0605 17:38:51.065351 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0013427734375 sec
W0605 17:38:51.067524 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003342628479003906 sec
W0605 17:38:51.068215 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.0002677440643310547 sec
W0605 17:38:51.069389 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00043010711669921875 sec
W0605 17:38:51.069930 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0012748241424560547 sec
W0605 17:38:51.072082 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00033545494079589844 sec
W0605 17:38:51.072823 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.00032258033752441406 sec
W0605 17:38:51.073729 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004303455352783203 sec
W0605 17:38:51.076472 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003464221954345703 sec
W0605 17:38:51.077437 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00040221214294433594 sec
W0605 17:38:51.080010 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003974437713623047 sec
W0605 17:38:51.084977 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003871917724609375 sec
W0605 17:38:51.085875 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003821849822998047 sec
W0605 17:38:51.106950 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.00027370452880859375 sec
W0605 17:38:51.108178 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0004146099090576172 sec
W0605 17:38:51.108645 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.001222848892211914 sec
W0605 17:38:51.109961 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.0036516189575195312 sec
W0605 17:38:51.113869 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.00024700164794921875 sec
W0605 17:38:51.115013 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003838539123535156 sec
W0605 17:38:51.115498 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.00118255615234375 sec
W0605 17:38:51.116794 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.0034394264221191406 sec
W0605 17:38:51.123821 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.00025463104248046875 sec
W0605 17:38:51.125084 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0004799365997314453 sec
W0605 17:38:51.125572 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012826919555664062 sec
W0605 17:38:51.126914 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.003610372543334961 sec
W0605 17:38:51.132493 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.00025391578674316406 sec
W0605 17:38:51.134341 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0010538101196289062 sec
W0605 17:38:51.134835 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0018856525421142578 sec
W0605 17:38:51.136130 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.004152536392211914 sec
W0605 17:38:51.140041 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.0002532005310058594 sec
W0605 17:38:51.141291 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0004820823669433594 sec
W0605 17:38:51.141758 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012583732604980469 sec
W0605 17:38:51.143049 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.003523588180541992 sec
W0605 17:38:51.146885 139756486557120 dispatch.py:272] Finished tracing + transforming isnan for pjit in 0.0002493858337402344 sec
W0605 17:38:51.148137 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0004699230194091797 sec
W0605 17:38:51.148612 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012650489807128906 sec
W0605 17:38:51.149911 139756486557120 dispatch.py:272] Finished tracing + transforming nan_to_num for pjit in 0.003537416458129883 sec
W0605 17:38:51.164990 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003509521484375 sec
W0605 17:38:51.167197 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00033211708068847656 sec
W0605 17:38:51.179063 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034117698669433594 sec
W0605 17:38:51.181202 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00036454200744628906 sec
W0605 17:38:51.205911 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00033593177795410156 sec
W0605 17:38:51.208079 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003936290740966797 sec
W0605 17:38:51.267738 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00036597251892089844 sec
W0605 17:38:51.269978 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003571510314941406 sec
W0605 17:38:51.283516 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003502368927001953 sec
W0605 17:38:51.295324 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003483295440673828 sec
W0605 17:38:51.297498 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034880638122558594 sec
W0605 17:38:51.301339 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00040984153747558594 sec
W0605 17:38:51.302777 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002617835998535156 sec
W0605 17:38:51.304450 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00033354759216308594 sec
W0605 17:38:51.305875 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003902912139892578 sec
W0605 17:38:51.307199 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00026226043701171875 sec
W0605 17:38:51.308934 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0004074573516845703 sec
W0605 17:38:51.313358 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00040531158447265625 sec
W0605 17:38:51.314678 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002655982971191406 sec
W0605 17:38:51.316417 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003962516784667969 sec
W0605 17:38:51.324674 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003845691680908203 sec
W0605 17:38:51.326095 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003135204315185547 sec
W0605 17:38:51.327770 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.000331878662109375 sec
W0605 17:38:51.329179 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003993511199951172 sec
W0605 17:38:51.330489 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002593994140625 sec
W0605 17:38:51.332172 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.000331878662109375 sec
W0605 17:38:51.333650 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038313865661621094 sec
W0605 17:38:51.334960 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002560615539550781 sec
W0605 17:38:51.336631 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00034332275390625 sec
W0605 17:38:51.337970 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.000255584716796875 sec
W0605 17:38:51.339765 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0005276203155517578 sec
W0605 17:38:51.340311 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0014069080352783203 sec
W0605 17:38:51.347506 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003342628479003906 sec
W0605 17:38:51.349030 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.00026154518127441406 sec
W0605 17:38:51.350681 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004153251647949219 sec
W0605 17:38:51.351207 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.001252889633178711 sec
W0605 17:38:51.354593 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003325939178466797 sec
W0605 17:38:51.360800 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.00024509429931640625 sec
W0605 17:38:51.362443 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00042128562927246094 sec
W0605 17:38:51.362977 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0012655258178710938 sec
W0605 17:38:51.366878 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003380775451660156 sec
W0605 17:38:51.381033 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.0002589225769042969 sec
W0605 17:38:51.382707 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00042128562927246094 sec
W0605 17:38:51.383244 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0012717247009277344 sec
W0605 17:38:51.386665 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0004096031188964844 sec
W0605 17:38:51.388154 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.0002455711364746094 sec
W0605 17:38:51.389823 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.0004107952117919922 sec
W0605 17:38:51.390361 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0012784004211425781 sec
W0605 17:38:51.393694 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003459453582763672 sec
W0605 17:38:51.395206 139756486557120 dispatch.py:272] Finished tracing + transforming square for pjit in 0.00025153160095214844 sec
W0605 17:38:51.396828 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_sum for pjit in 0.00041174888610839844 sec
W0605 17:38:51.397366 139756486557120 dispatch.py:272] Finished tracing + transforming _mean for pjit in 0.0012562274932861328 sec
W0605 17:38:51.400723 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00036334991455078125 sec
W0605 17:38:51.402534 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00038313865661621094 sec
W0605 17:38:51.413817 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003960132598876953 sec
W0605 17:38:51.458365 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.00040435791015625 sec
W0605 17:38:51.458882 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012650489807128906 sec
W0605 17:38:51.460613 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003859996795654297 sec
W0605 17:38:51.461089 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.001180410385131836 sec
W0605 17:38:51.462345 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003674030303955078 sec
W0605 17:38:51.462806 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011293888092041016 sec
W0605 17:38:51.464207 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003802776336669922 sec
W0605 17:38:51.464680 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012753009796142578 sec
W0605 17:38:51.465978 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003809928894042969 sec
W0605 17:38:51.466452 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011639595031738281 sec
W0605 17:38:51.467733 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.00038886070251464844 sec
W0605 17:38:51.468206 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011620521545410156 sec
W0605 17:38:51.469493 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0004017353057861328 sec
W0605 17:38:51.469960 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011703968048095703 sec
W0605 17:38:51.471318 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003769397735595703 sec
W0605 17:38:51.471788 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012331008911132812 sec
W0605 17:38:51.474819 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003876686096191406 sec
W0605 17:38:51.475300 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011796951293945312 sec
W0605 17:38:51.476653 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.00045108795166015625 sec
W0605 17:38:51.477125 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0012183189392089844 sec
W0605 17:38:51.478438 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.000385284423828125 sec
W0605 17:38:51.478904 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011568069458007812 sec
W0605 17:38:51.480214 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.00021004676818847656 sec
W0605 17:38:51.480560 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0008604526519775391 sec
W0605 17:38:51.484501 139756486557120 dispatch.py:272] Finished tracing + transforming _broadcast_arrays for pjit in 0.0003781318664550781 sec
W0605 17:38:51.484953 139756486557120 dispatch.py:272] Finished tracing + transforming _where for pjit in 0.0011394023895263672 sec
W0605 17:38:51.518642 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00034880638122558594 sec
W0605 17:38:51.519460 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003371238708496094 sec
W0605 17:38:51.520232 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003368854522705078 sec
W0605 17:38:51.520990 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003304481506347656 sec
W0605 17:38:51.522938 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004017353057861328 sec
W0605 17:38:51.523720 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003342628479003906 sec
W0605 17:38:51.524483 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003447532653808594 sec
W0605 17:38:51.526116 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004146099090576172 sec
W0605 17:38:51.526935 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002446174621582031 sec
W0605 17:38:51.527758 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.0003864765167236328 sec
W0605 17:38:51.528560 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003807544708251953 sec
W0605 17:38:51.533638 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002372264862060547 sec
W0605 17:38:51.534479 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00039267539978027344 sec
W0605 17:38:51.537130 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002434253692626953 sec
W0605 17:38:51.538020 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00043773651123046875 sec
W0605 17:38:51.540642 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002391338348388672 sec
W0605 17:38:51.541481 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00040149688720703125 sec
W0605 17:38:51.544121 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.00024962425231933594 sec
W0605 17:38:51.545009 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00044608116149902344 sec
W0605 17:38:51.546122 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00033855438232421875 sec
W0605 17:38:51.548141 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.00024080276489257812 sec
W0605 17:38:51.548953 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00038123130798339844 sec
W0605 17:38:51.550048 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003368854522705078 sec
W0605 17:38:51.552147 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002999305725097656 sec
W0605 17:38:51.552977 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.0003833770751953125 sec
W0605 17:38:51.554072 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.0003323554992675781 sec
W0605 17:38:51.556094 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.000232696533203125 sec
W0605 17:38:51.556920 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.0003819465637207031 sec
W0605 17:38:51.558031 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00033664703369140625 sec
W0605 17:38:51.570347 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.00024271011352539062 sec
W0605 17:38:51.571178 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.0003898143768310547 sec
W0605 17:38:51.572269 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00032901763916015625 sec
W0605 17:38:51.574327 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002353191375732422 sec
W0605 17:38:51.575155 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00037741661071777344 sec
W0605 17:38:51.576297 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00038695335388183594 sec
W0605 17:38:51.578324 139756486557120 dispatch.py:272] Finished tracing + transforming absolute for pjit in 0.0002453327178955078 sec
W0605 17:38:51.579146 139756486557120 dispatch.py:272] Finished tracing + transforming _reduce_max for pjit in 0.00038051605224609375 sec
W0605 17:38:51.580233 139756486557120 dispatch.py:272] Finished tracing + transforming true_divide for pjit in 0.00032901763916015625 sec
W0605 17:38:51.625336 139756486557120 dispatch.py:272] Finished tracing + transforming _wrapped_step_fn for pmap in 1.566312313079834 sec
W0605 17:38:51.626110 139756486557120 pxla.py:859] Compiling _wrapped_step_fn (139741169123712) for 1 devices with args (ShapedArray(uint32[1]), ShapedArray(float32[1,2048]), ShapedArray(float32[1,2048]), ShapedArray(float32[1,2048,2048]), ShapedArray(float32[1,51200]), ShapedArray(float32[1,2048,51200]), ShapedArray(float32[1,24,8192]), ShapedArray(float32[1,24,2048,8192]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,8192,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,3,2048,32,64]), ShapedArray(float32[1,24,64]), ShapedArray(float32[1,24,2048,32,64]), ShapedArray(int32[1]), ShapedArray(int32[1]), ShapedArray(int32[1]), ShapedArray(float32[1,2048]), ShapedArray(float32[1,2048]), ShapedArray(float32[1,2048,2048]), ShapedArray(float32[1,51200]), ShapedArray(float32[1,2048,51200]), ShapedArray(float32[1,2048]), ShapedArray(float32[1,2048]), ShapedArray(float32[1,2048,2048]), ShapedArray(float32[1,51200]), ShapedArray(float32[1,2048,51200]), ShapedArray(int32[1]), ShapedArray(int32[1,24]), ShapedArray(int32[1,24]), ShapedArray(int32[1,24]), ShapedArray(float32[1,24,8192]), ShapedArray(float32[1,24,2048,8192]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,8192,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,3,2048,32,64]), ShapedArray(float32[1,24,64]), ShapedArray(float32[1,24,2048,32,64]), ShapedArray(float32[1,24,8192]), ShapedArray(float32[1,24,2048,8192]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,8192,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,2048]), ShapedArray(float32[1,24,3,2048,32,64]), ShapedArray(float32[1,24,64]), ShapedArray(float32[1,24,2048,32,64]), ShapedArray(int32[1,24]), ShapedArray(uint32[1,2]), ShapedArray(float32[1,1]), ShapedArray(int32[1,1,2048]), ShapedArray(int32[1,1,2048]), ShapedArray(float32[1,1,2048]), ShapedArray(int32[1,1,2048]), ShapedArray(int32[1,1,2048]), ShapedArray(float32[1,1,2048])). (num_replicas=1)
/workspace/jax/jax/_src/interpreters/mlir.py:618: UserWarning: Some donated buffers were not usable: ShapedArray(uint32[]), ShapedArray(float32[2048]), ShapedArray(float32[2048]), ShapedArray(float32[2048,2048]), ShapedArray(float32[51200]), ShapedArray(float32[2048,51200]), ShapedArray(float32[24,8192]), ShapedArray(float32[24,2048,8192]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,8192,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,3,2048,32,64]), ShapedArray(float32[24,64]), ShapedArray(float32[24,2048,32,64]), ShapedArray(int32[]), ShapedArray(int32[]), ShapedArray(int32[]), ShapedArray(float32[2048]), ShapedArray(float32[2048]), ShapedArray(float32[2048,2048]), ShapedArray(float32[51200]), ShapedArray(float32[2048,51200]), ShapedArray(float32[2048]), ShapedArray(float32[2048]), ShapedArray(float32[2048,2048]), ShapedArray(float32[51200]), ShapedArray(float32[2048,51200]), ShapedArray(int32[]), ShapedArray(int32[24]), ShapedArray(int32[24]), ShapedArray(int32[24]), ShapedArray(float32[24,8192]), ShapedArray(float32[24,2048,8192]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,8192,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,3,2048,32,64]), ShapedArray(float32[24,64]), ShapedArray(float32[24,2048,32,64]), ShapedArray(float32[24,8192]), ShapedArray(float32[24,2048,8192]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,8192,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,2048]), ShapedArray(float32[24,3,2048,32,64]), ShapedArray(float32[24,64]), ShapedArray(float32[24,2048,32,64]), ShapedArray(int32[24]).
Donation is not implemented for iree_cuda.
See an explanation at https://jax.readthedocs.io/en/latest/faq.html#buffer-donation.
warnings.warn(f"Some donated buffers were not usable: {', '.join(unused_donations)}.\n{msg}")
W0605 17:38:51.637015 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00033736228942871094 sec
W0605 17:38:51.637705 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_seed for pjit in 0.0017371177673339844 sec
W0605 17:38:51.639181 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.0001888275146484375 sec
W0605 17:38:51.639944 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0015056133270263672 sec
W0605 17:38:51.640912 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_fold_in for pjit in 0.0051746368408203125 sec
W0605 17:38:51.645074 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0004105567932128906 sec
W0605 17:38:51.646182 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003085136413574219 sec
W0605 17:38:51.647176 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003046989440917969 sec
W0605 17:38:51.648066 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0002963542938232422 sec
W0605 17:38:51.648782 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00030303001403808594 sec
W0605 17:38:51.700799 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00019788742065429688 sec
W0605 17:38:51.701670 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0016200542449951172 sec
W0605 17:38:51.702680 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.0029451847076416016 sec
W0605 17:38:51.706055 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003178119659423828 sec
W0605 17:38:51.707761 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0010182857513427734 sec
W0605 17:38:51.708686 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003020763397216797 sec
W0605 17:38:51.709419 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003161430358886719 sec
W0605 17:38:51.762471 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.00020575523376464844 sec
W0605 17:38:51.763350 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.0016407966613769531 sec
W0605 17:38:51.764356 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.002961874008178711 sec
W0605 17:38:51.767745 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003142356872558594 sec
W0605 17:38:51.768834 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0004010200500488281 sec
W0605 17:38:51.769746 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003020763397216797 sec
W0605 17:38:51.770464 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003070831298828125 sec
W0605 17:38:51.834810 139756486557120 dispatch.py:272] Finished tracing + transforming ravel for pjit in 0.0002052783966064453 sec
W0605 17:38:51.835697 139756486557120 dispatch.py:272] Finished tracing + transforming threefry_2x32 for pjit in 0.001672983169555664 sec
W0605 17:38:51.836712 139756486557120 dispatch.py:272] Finished tracing + transforming _threefry_split_original for pjit in 0.003003358840942383 sec
W0605 17:38:51.840270 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00044655799865722656 sec
W0605 17:38:51.841293 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003247261047363281 sec
W0605 17:38:51.842204 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.00030112266540527344 sec
W0605 17:38:51.842923 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003039836883544922 sec
W0605 17:38:51.904477 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003521442413330078 sec
W0605 17:38:51.906139 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003561973571777344 sec
W0605 17:38:51.930905 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.000316619873046875 sec
W0605 17:38:52.100980 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003898143768310547 sec
W0605 17:38:52.102146 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003616809844970703 sec
W0605 17:38:52.102983 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.00030803680419921875 sec
W0605 17:38:52.103872 139756486557120 dispatch.py:272] Finished tracing + transforming fn for pjit in 0.0003063678741455078 sec
W0605 17:38:52.131105 139756486557120 dispatch.py:272] Finished tracing + transforming <lambda> for pjit in 0.0003311634063720703 sec
W0605 17:38:52.923904 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion pmap(_wrapped_step_fn) in 1.2972416877746582 sec
W0605 17:39:41.445357 139756486557120 dispatch.py:272] Finished XLA compilation of _wrapped_step_fn in 48.50671124458313 sec
W0605 17:39:41.464271 139756486557120 dispatch.py:272] Finished tracing + transforming _multi_slice for pjit in 0.0005140304565429688 sec
W0605 17:39:41.465060 139756486557120 pxla.py:1882] Compiling _multi_slice for with global shapes and types [ShapedArray(uint32[1,2])]. Argument mapping: (GSPMDSharding({replicated}),).
W0605 17:39:41.470349 139756486557120 dispatch.py:272] Finished jaxpr to MLIR module conversion jit(_multi_slice) in 0.005116462707519531 sec
W0605 17:39:41.740634 139756486557120 dispatch.py:272] Finished XLA compilation of jit(_multi_slice) in 0.2699155807495117 sec
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600142a3c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600142a3c0, semaphore=0x6060001683e0, value=89 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=90, fence=0x6040003e4450 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600142a3c0, from_fence=0x606000359e40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008a3080, semaphore=0x6060001683e0, value=90 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000476ba0, f=0, wait_fence=0x60600142a3c0 {0x6060001683e0:89}, signal_fence=0x6040003e4450 {0x6060001683e0:90} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600142aa80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600142ab40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600142aa80, semaphore=0x6060001683e0, value=90 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600142ab40, semaphore=0x6060001683e0, value=90 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=4, buffer=0x60c003e70dc0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=4, wait={0x6060001682c0:132}, signal={0x6060001682c0:133} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:133}, signal={0x6060001682c0:134} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606002537d80 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606002537cc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606002537d80, semaphore=0x6060001682c0, value=134 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606002537cc0, semaphore=0x6060001682c0, value=134 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=8192, buffer=0x60c003e70ac0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=8192, wait={0x6060001682c0:134}, signal={0x6060001682c0:135} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:135}, signal={0x6060001682c0:136} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000280f40 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000542240 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000280f40, semaphore=0x6060001682c0, value=136 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000542240, semaphore=0x6060001682c0, value=136 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=8192, buffer=0x60c003e707c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=8192, wait={0x6060001682c0:136}, signal={0x6060001682c0:137} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:137}, signal={0x6060001682c0:138} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606001b12fe0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600072f0e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606001b12fe0, semaphore=0x6060001682c0, value=138 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072f0e0, semaphore=0x6060001682c0, value=138 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=8192, buffer=0x60c003e704c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=8192, wait={0x6060001682c0:138}, signal={0x6060001682c0:139} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:139}, signal={0x6060001682c0:140} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060014ae2a0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606001773980 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060014ae2a0, semaphore=0x6060001682c0, value=140 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606001773980, semaphore=0x6060001682c0, value=140 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=8192, buffer=0x60c003e701c0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=8192, wait={0x6060001682c0:140}, signal={0x6060001682c0:141} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:141}, signal={0x6060001682c0:142} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606001061900 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000bd0620 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606001061900, semaphore=0x6060001682c0, value=142 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bd0620, semaphore=0x6060001682c0, value=142 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=8192, buffer=0x60c003e0fec0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=8192, wait={0x6060001682c0:142}, signal={0x6060001682c0:143} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:143}, signal={0x6060001682c0:144} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606001013480 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x6060014ae480 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606001013480, semaphore=0x6060001682c0, value=144 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060014ae480, semaphore=0x6060001682c0, value=144 (OK)
:: IREE INVOKE (hal_allocator_allocate_buffer): allocator=0x60b000158cc0, size=8192, buffer=0x60c003e0fbc0 (OK)
:: IREE INVOKE (hal_device_queue_alloca): device=0x6110011f9800, size=8192, wait={0x6060001682c0:144}, signal={0x6060001682c0:145} (OK)
:: IREE INVOKE (hal_device_queue_execute): device=0x6110011f9800, wait={0x6060001682c0:145}, signal={0x6060001682c0:146} (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606002c0c3c0 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x606000c9f680 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606002c0c3c0, semaphore=0x6060001682c0, value=146 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c9f680, semaphore=0x6060001682c0, value=146 (OK)
:: IREE INVOKE (hal_fence_create): capacity=2, fence=0x60600089edc0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600089edc0, semaphore=0x6060001683e0, value=90 (OK)
:: IREE INVOKE (hal_fence_create_at): semaphore=0x6060001683e0, value=91, fence=0x60400144bb50 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000a17c20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004cfdc0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600079a8a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600079a900, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600079a960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600079a9c0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600079aa20 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600075fbc0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000681620 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b00c0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006f03e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006811a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060002b7720 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681ec0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000680de0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681e60, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006b0600 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000682940, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006b2040 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b25e0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000681b00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b0540, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006b2100 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006824c0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060002b77e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b1c80, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006b1440 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b0180, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006b1860 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000681680, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000682ca0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b2ac0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000681a40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b0f00, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000bbe920 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600081a4c0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000bbea40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000753b00, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000a93c40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a93820, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006b29a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006b2b20, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000970700 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600034d420, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000967e80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000967e20, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000ca0c40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006442a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060005719a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007cf6a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060005b9700 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060005ba060, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000946700 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009467c0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060007e8240 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000761a80, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060003915e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060001c5e60, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600044a9e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600061e4a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000927e00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000315740, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000965a80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060006dbb00, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060008210c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600086c7e0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060007a8160 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000503c60, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600009c7a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000779d80, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060004af600 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003eb820, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600049fc40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060004e61a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060004fbaa0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003a06a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060003f5780 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600089cfc0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060009c0200 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c41900, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060009c08c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009e2940, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000bca3e0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bca5c0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600065b940 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000659900, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600023cde0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060009040a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060001619c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060002a2c00, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006d96a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600035b820, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600086a5c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000cbc960, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006fdf40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000452720, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000c07160 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060003f8480, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060009a7960 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000d16de0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060005cd560 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060007c6280, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060002c5ee0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060008c5340, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600038bd00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b5d360, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060006d6160 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000a045a0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060005686a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b56e20, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000a1fae0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000b57240, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000abce00 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000771f80, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x60600142aa80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600142ab40, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606002537d80 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606002537cc0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606000280f40 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000542240, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606001b12fe0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x60600072f0e0, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x6060014ae2a0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606001773980, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606001061900 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000bd0620, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606001013480 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x6060014ae480, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (hal_fence_extend): into_fence=0x60600089edc0, from_fence=0x606002c0c3c0 (OK)
:: IREE INVOKE (hal_fence_insert): fence=0x606000c9f680, semaphore=0x6060001683e0, value=91 (OK)
:: IREE INVOKE (vm_invoke[async]): context=0x60b000906510, f=0, wait_fence=0x60600089edc0 {0x6060001683e0:90, 0x6060001682c0:146}, signal_fence=0x60400144bb50 {0x6060001683e0:91}=================================================================
==12037==ERROR: AddressSanitizer: use-after-poison on address 0x62d00047e400 at pc 0x7f1b9823db9a bp 0x7ffdadc04c50 sp 0x7ffdadc04420
WRITE of size 72 at 0x62d00047e400 thread T0
#0 0x7f1b9823db99 in __asan_memcpy (/usr/lib/llvm-14/lib/clang/14.0.0/lib/linux/libclang_rt.asan-x86_64.so+0xccb99) (BuildId: 0fc20d2022c0d572c45850b6d559d9ccfabd5443)
#1 0x7f19603f099f in iree_hal_collective_batch_append /proc/self/cwd/external/iree_core/runtime/src/iree/hal/utils/collective_batch.c:102:36
#2 0x7f19603d7aab in iree_hal_cuda_stream_command_buffer_collective /proc/self/cwd/external/iree_core/runtime/src/iree/hal/drivers/cuda/stream_command_buffer.c:389:10
#3 0x7f196041aa42 in iree_hal_command_buffer_collective /proc/self/cwd/external/iree_core/runtime/src/iree/hal/command_buffer.c:494:26
#4 0x7f19603f3727 in iree_hal_deferred_command_buffer_apply_collective /proc/self/cwd/external/iree_core/runtime/src/iree/hal/utils/deferred_command_buffer.c:645:10
#5 0x7f19603f18c0 in iree_hal_deferred_command_buffer_apply /proc/self/cwd/external/iree_core/runtime/src/iree/hal/utils/deferred_command_buffer.c:923:16
#6 0x7f19603b01cd in iree_hal_cuda_device_queue_execute /proc/self/cwd/external/iree_core/runtime/src/iree/hal/drivers/cuda/cuda_device.c:527:7
#7 0x7f19604254e4 in iree_hal_device_queue_execute /proc/self/cwd/external/iree_core/runtime/src/iree/hal/device.c:249:26
#8 0x7f195e9d5dfe in iree_hal_module_device_queue_execute /proc/self/cwd/external/iree_core/runtime/src/iree/modules/hal/module.c:1014:10
#9 0x7f195eb4fd95 in iree_vm_shim_rIrrCrD_v /proc/self/cwd/external/iree_core/runtime/src/iree/vm/shims.c:68:1
#10 0x7f195eb31d97 in iree_vm_native_module_issue_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/native_module.c:338:7
#11 0x7f195eb309e9 in iree_vm_native_module_begin_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/native_module.c:392:10
#12 0x7f195ea4632c in iree_vm_bytecode_issue_import_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:452:7
#13 0x7f195ea41d2e in iree_vm_bytecode_call_import_variadic /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:609:10
#14 0x7f195ea29d9d in iree_vm_bytecode_dispatch /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:1667:5
#15 0x7f195e9ff564 in iree_vm_bytecode_dispatch_begin /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:636:10
#16 0x7f195e9f3fd0 in iree_vm_bytecode_module_begin_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/module.c:779:10
#17 0x7f195eb10c2e in iree_vm_begin_invoke /proc/self/cwd/external/iree_core/runtime/src/iree/vm/invocation.c:504:7
#18 0x7f195eb0e64a in iree_vm_invoke /proc/self/cwd/external/iree_core/runtime/src/iree/vm/invocation.c:302:26
#19 0x7f195e95abb8 in iree::pjrt::LoadedExecutableInstance::BatchExecute(PJRT_LoadedExecutable_Execute_Args*) /proc/self/cwd/iree/integrations/pjrt/common/api_impl.cc:1797:9
#20 0x7f195e964d8d in iree::pjrt::LoadedExecutableInstance::BindApi(PJRT_Api*)::$_54::operator()(PJRT_LoadedExecutable_Execute_Args*) const /proc/self/cwd/iree/integrations/pjrt/common/api_impl.cc:1590:61
#21 0x7f195e964d34 in iree::pjrt::LoadedExecutableInstance::BindApi(PJRT_Api*)::$_54::__invoke(PJRT_LoadedExecutable_Execute_Args*) /proc/self/cwd/iree/integrations/pjrt/common/api_impl.cc:1587:8
#22 0x7f1b8e11a00a in xla::PjRtCApiLoadedExecutable::Execute(absl::lts_20230125::Span<std::vector<xla::PjRtBuffer*, std::allocator<xla::PjRtBuffer*> > const>, xla::ExecuteOptions const&, std::optional<std::vector<xla::PjRtFuture<absl::lts_20230125::Status>, std::allocator<xla::PjRtFuture<absl::lts_20230125::Status> > > >&) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0xcaf00a) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#23 0x7f1b906deca6 in xla::ifrt::PjRtLoadedExecutable::Execute(absl::lts_20230125::Span<tsl::RCReference<xla::ifrt::Array> >, xla::ExecuteOptions const&, std::optional<xla::ifrt::DeviceList>) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x3273ca6) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#24 0x7f1b8e0c8836 in absl::lts_20230125::StatusOr<xla::PyExecuteResults> xla::(anonymous namespace)::ExecuteShardedOnLocalDevicesInternal<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, xla::(anonymous namespace)::ShardedBufferAdapter<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >(xla::ExecuteOptions const&, std::shared_ptr<xla::PyClient> const&, xla::ifrt::LoadedExecutable*, absl::lts_20230125::Span<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > const>, std::optional<std::vector<xla::PjRtFuture<absl::lts_20230125::Status>, std::allocator<xla::PjRtFuture<absl::lts_20230125::Status> > > >&) py_executable.cc
#25 0x7f1b8e0c9b8d in xla::PyLoadedExecutable::ExecuteSharded(std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0xc5eb8d) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#26 0x7f1b8dd9b1d3 in void pybind11::cpp_function::initialize<xla::ValueOrThrowWrapper<absl::lts_20230125::StatusOr<xla::PyExecuteResults> (std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool), xla::PyLoadedExecutable>, xla::PyExecuteResults, xla::PyLoadedExecutable&, std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool, pybind11::name, pybind11::is_method, pybind11::sibling, pybind11::arg, pybind11::arg_v>(xla::ValueOrThrowWrapper<absl::lts_20230125::StatusOr<xla::PyExecuteResults> (std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool), xla::PyLoadedExecutable>&&, xla::PyExecuteResults (*)(xla::PyLoadedExecutable&, std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool), pybind11::name const&, pybind11::is_method const&, pybind11::sibling const&, pybind11::arg const&, pybind11::arg_v const&)::'lambda1'(pybind11::detail::function_call&)::operator()(pybind11::detail::function_call&) const (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x9301d3) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#27 0x7f1b8dd6fed0 in pybind11::cpp_function::dispatcher(_object*, _object*, _object*) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x904ed0) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#28 0x55e95b1b499d (/usr/bin/python3.10+0x15c99d) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#29 0x55e95b1ab4aa in _PyObject_MakeTpCall (/usr/bin/python3.10+0x1534aa) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#30 0x55e95b1c2f0a (/usr/bin/python3.10+0x16af0a) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#31 0x55e95b1a3461 in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x14b461) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#32 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#33 0x55e95b19faef in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x147aef) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#34 0x55e95b1aa633 in _PyObject_FastCallDictTstate (/usr/bin/python3.10+0x152633) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#35 0x55e95b1bfd10 in _PyObject_Call_Prepend (/usr/bin/python3.10+0x167d10) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#36 0x55e95b2dd60f (/usr/bin/python3.10+0x28560f) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#37 0x55e95b1c387a in PyObject_Call (/usr/bin/python3.10+0x16b87a) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#38 0x55e95b19faef in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x147aef) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#39 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#40 0x55e95b19faef in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x147aef) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#41 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#42 0x7f1b8de43a4c in jax::PmapFunction::Call(pybind11::handle, _object* const*, unsigned long, _object*) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x9d8a4c) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#43 0x7f1b8de4424a in JaxPmapFunction_tp_vectorcall (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x9d924a) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#44 0x55e95b19d784 in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x145784) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#45 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#46 0x55e95b19d784 in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x145784) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#47 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#48 0x55e95b19d8ca in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x1458ca) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#49 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#50 0x55e95b19d8ca in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x1458ca) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#51 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#52 0x55e95b1c38e1 in PyObject_Call (/usr/bin/python3.10+0x16b8e1) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#53 0x55e95b19faef in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x147aef) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#54 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#55 0x55e95b19d8ca in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x1458ca) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#56 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#57 0x55e95b19eade in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x146ade) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#58 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#59 0x55e95b19eade in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x146ade) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#60 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#61 0x55e95b19eade in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x146ade) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#62 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#63 0x55e95b19d784 in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x145784) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#64 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#65 0x55e95b19d784 in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x145784) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#66 0x55e95b1b51eb in _PyFunction_Vectorcall (/usr/bin/python3.10+0x15d1eb) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#67 0x55e95b19eade in _PyEval_EvalFrameDefault (/usr/bin/python3.10+0x146ade) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#68 0x55e95b199ed5 (/usr/bin/python3.10+0x141ed5) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#69 0x55e95b290365 in PyEval_EvalCode (/usr/bin/python3.10+0x238365) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#70 0x55e95b2bd107 (/usr/bin/python3.10+0x265107) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#71 0x55e95b2b5f5a (/usr/bin/python3.10+0x25df5a) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#72 0x55e95b2bce54 (/usr/bin/python3.10+0x264e54) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#73 0x55e95b2bc337 in _PyRun_SimpleFileObject (/usr/bin/python3.10+0x264337) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#74 0x55e95b2bc032 in _PyRun_AnyFileObject (/usr/bin/python3.10+0x264032) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#75 0x55e95b2ad2dd in Py_RunMain (/usr/bin/python3.10+0x2552dd) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#76 0x55e95b28332c in Py_BytesMain (/usr/bin/python3.10+0x22b32c) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
#77 0x7f1b97e3ed8f (/usr/lib/x86_64-linux-gnu/libc.so.6+0x29d8f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
#78 0x7f1b97e3ee3f in __libc_start_main (/usr/lib/x86_64-linux-gnu/libc.so.6+0x29e3f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
#79 0x55e95b283224 in _start (/usr/bin/python3.10+0x22b224) (BuildId: 148e086667839ef13939196984d6f717c331bd76)
0x62d00047e400 is located 0 bytes inside of 32768-byte region [0x62d00047e400,0x62d000486400)
allocated by thread T0 here:
#0 0x7f1b9823e7ee in __interceptor_malloc (/usr/lib/llvm-14/lib/clang/14.0.0/lib/linux/libclang_rt.asan-x86_64.so+0xcd7ee) (BuildId: 0fc20d2022c0d572c45850b6d559d9ccfabd5443)
#1 0x7f1960444f70 in iree_allocator_system_alloc /proc/self/cwd/external/iree_core/runtime/src/iree/base/allocator.c:88:17
#2 0x7f1960444a39 in iree_allocator_system_ctl /proc/self/cwd/external/iree_core/runtime/src/iree/base/allocator.c:126:14
#3 0x7f1960443ca8 in iree_allocator_issue_alloc /proc/self/cwd/external/iree_core/runtime/src/iree/base/allocator.c:27:10
#4 0x7f1960443f44 in iree_allocator_malloc_uninitialized /proc/self/cwd/external/iree_core/runtime/src/iree/base/allocator.c:38:10
#5 0x7f19603fc567 in iree_arena_block_pool_acquire /proc/self/cwd/external/iree_core/runtime/src/iree/base/internal/arena.c:77:5
#6 0x7f19603fd455 in iree_arena_allocate /proc/self/cwd/external/iree_core/runtime/src/iree/base/internal/arena.c:187:5
#7 0x7f19603f9cc6 in iree_hal_cmd_list_append_command /proc/self/cwd/external/iree_core/runtime/src/iree/hal/utils/deferred_command_buffer.c:113:3
#8 0x7f19603f7eb3 in iree_hal_deferred_command_buffer_push_constants /proc/self/cwd/external/iree_core/runtime/src/iree/hal/utils/deferred_command_buffer.c:672:3
#9 0x7f196041ad49 in iree_hal_command_buffer_push_constants /proc/self/cwd/external/iree_core/runtime/src/iree/hal/command_buffer.c:518:26
#10 0x7f195e9d058c in iree_hal_module_command_buffer_push_constants /proc/self/cwd/external/iree_core/runtime/src/iree/modules/hal/module.c:779:10
#11 0x7f195eb49c15 in iree_vm_shim_rriCiD_v /proc/self/cwd/external/iree_core/runtime/src/iree/vm/shims.c:56:1
#12 0x7f195eb31d97 in iree_vm_native_module_issue_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/native_module.c:338:7
#13 0x7f195eb309e9 in iree_vm_native_module_begin_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/native_module.c:392:10
#14 0x7f195ea4632c in iree_vm_bytecode_issue_import_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:452:7
#15 0x7f195ea41d2e in iree_vm_bytecode_call_import_variadic /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:609:10
#16 0x7f195ea29d9d in iree_vm_bytecode_dispatch /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:1667:5
#17 0x7f195e9ff564 in iree_vm_bytecode_dispatch_begin /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/dispatch.c:636:10
#18 0x7f195e9f3fd0 in iree_vm_bytecode_module_begin_call /proc/self/cwd/external/iree_core/runtime/src/iree/vm/bytecode/module.c:779:10
#19 0x7f195eb10c2e in iree_vm_begin_invoke /proc/self/cwd/external/iree_core/runtime/src/iree/vm/invocation.c:504:7
#20 0x7f195eb0e64a in iree_vm_invoke /proc/self/cwd/external/iree_core/runtime/src/iree/vm/invocation.c:302:26
#21 0x7f195e95abb8 in iree::pjrt::LoadedExecutableInstance::BatchExecute(PJRT_LoadedExecutable_Execute_Args*) /proc/self/cwd/iree/integrations/pjrt/common/api_impl.cc:1797:9
#22 0x7f195e964d8d in iree::pjrt::LoadedExecutableInstance::BindApi(PJRT_Api*)::$_54::operator()(PJRT_LoadedExecutable_Execute_Args*) const /proc/self/cwd/iree/integrations/pjrt/common/api_impl.cc:1590:61
#23 0x7f195e964d34 in iree::pjrt::LoadedExecutableInstance::BindApi(PJRT_Api*)::$_54::__invoke(PJRT_LoadedExecutable_Execute_Args*) /proc/self/cwd/iree/integrations/pjrt/common/api_impl.cc:1587:8
#24 0x7f1b8e11a00a in xla::PjRtCApiLoadedExecutable::Execute(absl::lts_20230125::Span<std::vector<xla::PjRtBuffer*, std::allocator<xla::PjRtBuffer*> > const>, xla::ExecuteOptions const&, std::optional<std::vector<xla::PjRtFuture<absl::lts_20230125::Status>, std::allocator<xla::PjRtFuture<absl::lts_20230125::Status> > > >&) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0xcaf00a) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#25 0x7f1b906deca6 in xla::ifrt::PjRtLoadedExecutable::Execute(absl::lts_20230125::Span<tsl::RCReference<xla::ifrt::Array> >, xla::ExecuteOptions const&, std::optional<xla::ifrt::DeviceList>) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x3273ca6) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#26 0x7f1b8e0c8836 in absl::lts_20230125::StatusOr<xla::PyExecuteResults> xla::(anonymous namespace)::ExecuteShardedOnLocalDevicesInternal<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, xla::(anonymous namespace)::ShardedBufferAdapter<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >(xla::ExecuteOptions const&, std::shared_ptr<xla::PyClient> const&, xla::ifrt::LoadedExecutable*, absl::lts_20230125::Span<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > const>, std::optional<std::vector<xla::PjRtFuture<absl::lts_20230125::Status>, std::allocator<xla::PjRtFuture<absl::lts_20230125::Status> > > >&) py_executable.cc
#27 0x7f1b8e0c9b8d in xla::PyLoadedExecutable::ExecuteSharded(std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0xc5eb8d) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#28 0x7f1b8dd9b1d3 in void pybind11::cpp_function::initialize<xla::ValueOrThrowWrapper<absl::lts_20230125::StatusOr<xla::PyExecuteResults> (std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool), xla::PyLoadedExecutable>, xla::PyExecuteResults, xla::PyLoadedExecutable&, std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool, pybind11::name, pybind11::is_method, pybind11::sibling, pybind11::arg, pybind11::arg_v>(xla::ValueOrThrowWrapper<absl::lts_20230125::StatusOr<xla::PyExecuteResults> (std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool), xla::PyLoadedExecutable>&&, xla::PyExecuteResults (*)(xla::PyLoadedExecutable&, std::vector<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > >, std::allocator<std::variant<xla::PyArray, std::vector<xla::PyArray, std::allocator<xla::PyArray> > > > >, bool), pybind11::name const&, pybind11::is_method const&, pybind11::sibling const&, pybind11::arg const&, pybind11::arg_v const&)::'lambda1'(pybind11::detail::function_call&)::operator()(pybind11::detail::function_call&) const (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x9301d3) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
#29 0x7f1b8dd6fed0 in pybind11::cpp_function::dispatcher(_object*, _object*, _object*) (/usr/local/lib/python3.10/dist-packages/jaxlib/xla_extension.so+0x904ed0) (BuildId: 8d2c019e2cf0e2ad84df858ea45f0237)
SUMMARY: AddressSanitizer: use-after-poison (/usr/lib/llvm-14/lib/clang/14.0.0/lib/linux/libclang_rt.asan-x86_64.so+0xccb99) (BuildId: 0fc20d2022c0d572c45850b6d559d9ccfabd5443) in __asan_memcpy
Shadow bytes around the buggy address:
0x0c5a80087c30: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c5a80087c40: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c5a80087c50: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c5a80087c60: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c5a80087c70: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
=>0x0c5a80087c80:[f7]f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7
0x0c5a80087c90: f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7
0x0c5a80087ca0: f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7
0x0c5a80087cb0: f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7
0x0c5a80087cc0: f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7
0x0c5a80087cd0: f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7 f7
Shadow byte legend (one shadow byte represents 8 application bytes):
Addressable: 00
Partially addressable: 01 02 03 04 05 06 07
Heap left redzone: fa
Freed heap region: fd
Stack left redzone: f1
Stack mid redzone: f2
Stack right redzone: f3
Stack after return: f5
Stack use after scope: f8
Global redzone: f9
Global init order: f6
Poisoned by user: f7
Container overflow: fc
Array cookie: ac
Intra object redzone: bb
ASan internal: fe
Left alloca redzone: ca
Right alloca redzone: cb
==12037==ABORTING
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment