[2025-01-15 22:08:14,475 I 2377 2377] (raylet) main.cc:180: Setting cluster ID to: d0e6ae35423199889a5d80b472942a3bcc97627da03747d855262da7 [2025-01-15 22:08:14,484 I 2377 2377] (raylet) main.cc:289: Raylet is not set to kill unknown children. [2025-01-15 22:08:14,485 I 2377 2377] (raylet) io_service_pool.cc:35: IOServicePool is running with 1 io_service. [2025-01-15 22:08:14,486 I 2377 2377] (raylet) main.cc:419: Setting node ID node_id=db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 [2025-01-15 22:08:14,486 I 2377 2377] (raylet) store_runner.cc:32: Allowing the Plasma store to use up to 2.14748GB of memory. [2025-01-15 22:08:14,486 I 2377 2377] (raylet) store_runner.cc:48: Starting object store with directory /dev/shm, fallback /tmp/ray, and huge page support disabled [2025-01-15 22:08:14,487 I 2377 2406] (raylet) dlmalloc.cc:154: create_and_mmap_buffer(2147483656, /dev/shm/plasmaXXXXXX) [2025-01-15 22:08:14,488 I 2377 2406] (raylet) store.cc:564: Plasma store debug dump: Current usage: 0 / 2.14748 GB - num bytes created total: 0 0 pending objects of total size 0MB - objects spillable: 0 - bytes spillable: 0 - objects unsealed: 0 - bytes unsealed: 0 - objects in use: 0 - bytes in use: 0 - objects evictable: 0 - bytes evictable: 0 - objects created by worker: 0 - bytes created by worker: 0 - objects restored: 0 - bytes restored: 0 - objects received: 0 - bytes received: 0 - objects errored: 0 - bytes errored: 0 [2025-01-15 22:08:14,492 I 2377 2377] (raylet) grpc_server.cc:134: ObjectManager server started, listening on port 35937. [2025-01-15 22:08:14,495 I 2377 2377] (raylet) worker_killing_policy.cc:101: Running GroupByOwner policy. [2025-01-15 22:08:14,496 I 2377 2377] (raylet) memory_monitor.cc:47: MemoryMonitor initialized with usage threshold at 94999994368 bytes (0.95 system memory), total system memory bytes: 99999997952 [2025-01-15 22:08:14,496 I 2377 2377] (raylet) node_manager.cc:287: Initializing NodeManager node_id=db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 [2025-01-15 22:08:14,497 I 2377 2377] (raylet) grpc_server.cc:134: NodeManager server started, listening on port 42107. [2025-01-15 22:08:14,502 I 2377 2445] (raylet) agent_manager.cc:77: Monitor agent process with name dashboard_agent/424238335 [2025-01-15 22:08:14,504 I 2377 2447] (raylet) agent_manager.cc:77: Monitor agent process with name runtime_env_agent [2025-01-15 22:08:14,505 I 2377 2377] (raylet) event.cc:493: Ray Event initialized for RAYLET [2025-01-15 22:08:14,505 I 2377 2377] (raylet) event.cc:324: Set ray event level to warning [2025-01-15 22:08:14,507 I 2377 2377] (raylet) raylet.cc:134: Raylet of id, db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 started. Raylet consists of node_manager and object_manager. node_manager address: 192.168.0.2:42107 object_manager address: 192.168.0.2:35937 hostname: 0cd925b1f73b [2025-01-15 22:08:14,509 I 2377 2377] (raylet) node_manager.cc:525: [state-dump] NodeManager: [state-dump] Node ID: db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 [state-dump] Node name: 192.168.0.2 [state-dump] InitialConfigResources: {GPU: 20000, accelerator_type:A40: 10000, CPU: 200000, object_store_memory: 21474836480000, node:192.168.0.2: 10000, memory: 854930231300000, node:__internal_head__: 10000} [state-dump] ClusterTaskManager: [state-dump] ========== Node: db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 ================= [state-dump] Infeasible queue length: 0 [state-dump] Schedule queue length: 0 [state-dump] Dispatch queue length: 0 [state-dump] num_waiting_for_resource: 0 [state-dump] num_waiting_for_plasma_memory: 0 [state-dump] num_waiting_for_remote_node_resources: 0 [state-dump] num_worker_not_started_by_job_config_not_exist: 0 [state-dump] num_worker_not_started_by_registration_timeout: 0 [state-dump] num_tasks_waiting_for_workers: 0 [state-dump] num_cancelled_tasks: 0 [state-dump] cluster_resource_scheduler state: [state-dump] Local id: -752876438313338089 Local resources: {"total":{GPU: [10000, 10000], object_store_memory: [21474836480000], node:192.168.0.2: [10000], node:__internal_head__: [10000], accelerator_type:A40: [10000], CPU: [200000], memory: [854930231300000]}}, "available": {GPU: [10000, 10000], object_store_memory: [21474836480000], node:192.168.0.2: [10000], node:__internal_head__: [10000], accelerator_type:A40: [10000], CPU: [200000], memory: [854930231300000]}}, "labels":{"ray.io/node_id":"db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8",} is_draining: 0 is_idle: 1 Cluster resources: node id: -752876438313338089{"total":{node:192.168.0.2: 10000, object_store_memory: 21474836480000, CPU: 200000, GPU: 20000, accelerator_type:A40: 10000, memory: 854930231300000, node:__internal_head__: 10000}}, "available": {node:192.168.0.2: 10000, object_store_memory: 21474836480000, CPU: 200000, GPU: 20000, accelerator_type:A40: 10000, memory: 854930231300000, node:__internal_head__: 10000}}, "labels":{"ray.io/node_id":"db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8",}, "is_draining": 0, "draining_deadline_timestamp_ms": -1} { "placment group locations": [], "node to bundles": []} [state-dump] Waiting tasks size: 0 [state-dump] Number of executing tasks: 0 [state-dump] Number of pinned task arguments: 0 [state-dump] Number of total spilled tasks: 0 [state-dump] Number of spilled waiting tasks: 0 [state-dump] Number of spilled unschedulable tasks: 0 [state-dump] Resource usage { [state-dump] } [state-dump] Backlog Size per scheduling descriptor :{workerId: num backlogs}: [state-dump] [state-dump] Running tasks by scheduling class: [state-dump] ================================================== [state-dump] [state-dump] ClusterResources: [state-dump] LocalObjectManager: [state-dump] - num pinned objects: 0 [state-dump] - pinned objects size: 0 [state-dump] - num objects pending restore: 0 [state-dump] - num objects pending spill: 0 [state-dump] - num bytes pending spill: 0 [state-dump] - num bytes currently spilled: 0 [state-dump] - cumulative spill requests: 0 [state-dump] - cumulative restore requests: 0 [state-dump] - spilled objects pending delete: 0 [state-dump] [state-dump] ObjectManager: [state-dump] - num local objects: 0 [state-dump] - num unfulfilled push requests: 0 [state-dump] - num object pull requests: 0 [state-dump] - num chunks received total: 0 [state-dump] - num chunks received failed (all): 0 [state-dump] - num chunks received failed / cancelled: 0 [state-dump] - num chunks received failed / plasma error: 0 [state-dump] Event stats: [state-dump] Global stats: 0 total (0 active) [state-dump] Queueing time: mean = -nan s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] Execution time: mean = -nan s, total = 0.000 s [state-dump] Event stats: [state-dump] PushManager: [state-dump] - num pushes in flight: 0 [state-dump] - num chunks in flight: 0 [state-dump] - num chunks remaining: 0 [state-dump] - max chunks allowed: 409 [state-dump] OwnershipBasedObjectDirectory: [state-dump] - num listeners: 0 [state-dump] - cumulative location updates: 0 [state-dump] - num location updates per second: 70194581348060000.000 [state-dump] - num location lookups per second: 70194581348048000.000 [state-dump] - num locations added per second: 0.000 [state-dump] - num locations removed per second: 0.000 [state-dump] BufferPool: [state-dump] - create buffer state map size: 0 [state-dump] PullManager: [state-dump] - num bytes available for pulled objects: 2147483648 [state-dump] - num bytes being pulled (all): 0 [state-dump] - num bytes being pulled / pinned: 0 [state-dump] - get request bundles: BundlePullRequestQueue{0 total, 0 active, 0 inactive, 0 unpullable} [state-dump] - wait request bundles: BundlePullRequestQueue{0 total, 0 active, 0 inactive, 0 unpullable} [state-dump] - task request bundles: BundlePullRequestQueue{0 total, 0 active, 0 inactive, 0 unpullable} [state-dump] - first get request bundle: N/A [state-dump] - first wait request bundle: N/A [state-dump] - first task request bundle: N/A [state-dump] - num objects queued: 0 [state-dump] - num objects actively pulled (all): 0 [state-dump] - num objects actively pulled / pinned: 0 [state-dump] - num bundles being pulled: 0 [state-dump] - num pull retries: 0 [state-dump] - max timeout seconds: 0 [state-dump] - max timeout request is already processed. No entry. [state-dump] [state-dump] WorkerPool: [state-dump] - registered jobs: 0 [state-dump] - process_failed_job_config_missing: 0 [state-dump] - process_failed_rate_limited: 0 [state-dump] - process_failed_pending_registration: 0 [state-dump] - process_failed_runtime_env_setup_failed: 0 [state-dump] - num PYTHON workers: 0 [state-dump] - num PYTHON drivers: 0 [state-dump] - num PYTHON pending start requests: 0 [state-dump] - num PYTHON pending registration requests: 0 [state-dump] - num object spill callbacks queued: 0 [state-dump] - num object restore queued: 0 [state-dump] - num util functions queued: 0 [state-dump] - num idle workers: 0 [state-dump] TaskDependencyManager: [state-dump] - task deps map size: 0 [state-dump] - get req map size: 0 [state-dump] - wait req map size: 0 [state-dump] - local objects map size: 0 [state-dump] WaitManager: [state-dump] - num active wait requests: 0 [state-dump] Subscriber: [state-dump] Channel WORKER_OBJECT_LOCATIONS_CHANNEL [state-dump] - cumulative subscribe requests: 0 [state-dump] - cumulative unsubscribe requests: 0 [state-dump] - active subscribed publishers: 0 [state-dump] - cumulative published messages: 0 [state-dump] - cumulative processed messages: 0 [state-dump] Channel WORKER_REF_REMOVED_CHANNEL [state-dump] - cumulative subscribe requests: 0 [state-dump] - cumulative unsubscribe requests: 0 [state-dump] - active subscribed publishers: 0 [state-dump] - cumulative published messages: 0 [state-dump] - cumulative processed messages: 0 [state-dump] Channel WORKER_OBJECT_EVICTION [state-dump] - cumulative subscribe requests: 0 [state-dump] - cumulative unsubscribe requests: 0 [state-dump] - active subscribed publishers: 0 [state-dump] - cumulative published messages: 0 [state-dump] - cumulative processed messages: 0 [state-dump] num async plasma notifications: 0 [state-dump] Remote node managers: [state-dump] Event stats: [state-dump] Global stats: 27 total (13 active) [state-dump] Queueing time: mean = 1.327 ms, max = 9.363 ms, min = 34.550 us, total = 35.819 ms [state-dump] Execution time: mean = 1.075 ms, total = 29.035 ms [state-dump] Event stats: [state-dump] PeriodicalRunner.RunFnPeriodically - 11 total (2 active, 1 running), Execution time: mean = 185.388 us, total = 2.039 ms, Queueing time: mean = 3.232 ms, max = 9.363 ms, min = 34.550 us, total = 35.547 ms [state-dump] ray::rpc::InternalPubSubGcsService.grpc_client.GcsSubscriberCommandBatch.OnReplyReceived - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] NodeManager.deadline_timer.debug_state_dump - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] RayletWorkerPool.deadline_timer.kill_idle_workers - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] ray::rpc::NodeInfoGcsService.grpc_client.RegisterNode - 1 total (0 active), Execution time: mean = 1.938 ms, total = 1.938 ms, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] NodeManager.GCTaskFailureReason - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] ray::rpc::InternalPubSubGcsService.grpc_client.GcsSubscriberCommandBatch - 1 total (0 active), Execution time: mean = 1.589 ms, total = 1.589 ms, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] ray::rpc::InternalKVGcsService.grpc_client.GetInternalConfig.OnReplyReceived - 1 total (0 active), Execution time: mean = 21.207 ms, total = 21.207 ms, Queueing time: mean = 103.446 us, max = 103.446 us, min = 103.446 us, total = 103.446 us [state-dump] ray::rpc::NodeInfoGcsService.grpc_client.RegisterNode.OnReplyReceived - 1 total (0 active), Execution time: mean = 419.492 us, total = 419.492 us, Queueing time: mean = 168.673 us, max = 168.673 us, min = 168.673 us, total = 168.673 us [state-dump] ray::rpc::InternalPubSubGcsService.grpc_client.GcsSubscriberPoll - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] NodeManager.ScheduleAndDispatchTasks - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] ClusterResourceManager.ResetRemoteNodeView - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] ray::rpc::InternalKVGcsService.grpc_client.GetInternalConfig - 1 total (0 active), Execution time: mean = 1.843 ms, total = 1.843 ms, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] NodeManager.deadline_timer.spill_objects_when_over_threshold - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] MemoryMonitor.CheckIsMemoryUsageAboveThreshold - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] NodeManager.deadline_timer.flush_free_objects - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] NodeManager.deadline_timer.record_metrics - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s [state-dump] DebugString() time ms: 0 [state-dump] [state-dump] [2025-01-15 22:08:14,512 I 2377 2377] (raylet) accessor.cc:762: Received notification for node, IsAlive = 1 node_id=db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 [2025-01-15 22:08:14,658 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2484, the token is 0 [2025-01-15 22:08:14,663 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2485, the token is 1 [2025-01-15 22:08:14,665 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2486, the token is 2 [2025-01-15 22:08:14,668 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2487, the token is 3 [2025-01-15 22:08:14,670 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2488, the token is 4 [2025-01-15 22:08:14,672 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2489, the token is 5 [2025-01-15 22:08:14,675 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2490, the token is 6 [2025-01-15 22:08:14,677 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2491, the token is 7 [2025-01-15 22:08:14,679 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2492, the token is 8 [2025-01-15 22:08:14,681 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2493, the token is 9 [2025-01-15 22:08:14,683 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2494, the token is 10 [2025-01-15 22:08:14,685 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2495, the token is 11 [2025-01-15 22:08:14,688 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2496, the token is 12 [2025-01-15 22:08:14,690 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2497, the token is 13 [2025-01-15 22:08:14,692 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2498, the token is 14 [2025-01-15 22:08:14,694 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2499, the token is 15 [2025-01-15 22:08:14,697 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2500, the token is 16 [2025-01-15 22:08:14,700 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2501, the token is 17 [2025-01-15 22:08:14,703 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2502, the token is 18 [2025-01-15 22:08:14,706 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 2503, the token is 19 [2025-01-15 22:08:15,448 I 2377 2406] (raylet) object_store.cc:35: Object store current usage 8e-09 / 2.14748 GB. [2025-01-15 22:08:15,696 I 2377 2377] (raylet) worker_pool.cc:692: Job 01000000 already started in worker pool. [2025-01-15 22:08:16,563 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,563 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,564 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,564 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,565 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,571 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,573 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,573 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,574 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,574 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,590 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=3, has creation task exception = false [2025-01-15 22:08:16,775 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,776 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,776 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,777 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,777 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,777 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,778 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:16,778 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=1, has creation task exception = false [2025-01-15 22:08:17,453 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=3, has creation task exception = false [2025-01-15 22:08:17,462 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 4206, the token is 20 [2025-01-15 22:08:18,656 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=3, has creation task exception = false [2025-01-15 22:08:18,666 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 4307, the token is 21 [2025-01-15 22:08:19,907 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=3, has creation task exception = false [2025-01-15 22:08:19,916 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 4409, the token is 22 [2025-01-15 22:08:21,173 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=3, has creation task exception = false [2025-01-15 22:08:21,182 I 2377 2377] (raylet) worker_pool.cc:501: Started worker process with pid 4511, the token is 23 [2025-01-15 22:08:22,442 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=3, has creation task exception = false [2025-01-15 22:08:22,469 I 2377 2377] (raylet) node_manager.cc:1481: NodeManager::DisconnectClient, disconnect_type=3, has creation task exception = false [2025-01-15 22:08:22,469 I 2377 2377] (raylet) node_manager.cc:1586: Driver (pid=2112) is disconnected. worker_id=01000000ffffffffffffffffffffffffffffffffffffffffffffffff job_id=01000000 [2025-01-15 22:08:22,473 I 2377 2377] (raylet) worker_pool.cc:692: Job 01000000 already started in worker pool. [2025-01-15 22:08:22,664 I 2377 2377] (raylet) main.cc:454: received SIGTERM. Existing local drain request = None [2025-01-15 22:08:22,664 I 2377 2377] (raylet) main.cc:255: Raylet graceful shutdown triggered, reason = EXPECTED_TERMINATION, reason message = received SIGTERM [2025-01-15 22:08:22,664 I 2377 2377] (raylet) main.cc:258: Shutting down... [2025-01-15 22:08:22,664 I 2377 2377] (raylet) accessor.cc:510: Unregistering node node_id=db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 [2025-01-15 22:08:22,667 I 2377 2377] (raylet) accessor.cc:523: Finished unregistering node info, status = OK node_id=db1eca1a9346159f9fc7887dfa455f72c2c1f83b3d76b96b961421d8 [2025-01-15 22:08:22,672 I 2377 2377] (raylet) agent_manager.cc:112: Killing agent dashboard_agent/424238335, pid 2444. [2025-01-15 22:08:22,685 I 2377 2445] (raylet) agent_manager.cc:79: Agent process with name dashboard_agent/424238335 exited, exit code 0. [2025-01-15 22:08:22,685 I 2377 2377] (raylet) agent_manager.cc:112: Killing agent runtime_env_agent, pid 2446. [2025-01-15 22:08:22,695 I 2377 2447] (raylet) agent_manager.cc:79: Agent process with name runtime_env_agent exited, exit code 0. [2025-01-15 22:08:22,696 I 2377 2377] (raylet) io_service_pool.cc:47: IOServicePool is stopped. [2025-01-15 22:08:22,895 I 2377 2377] (raylet) stats.h:120: Stats module has shutdown.