File size: 9,076 Bytes
c011401 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 |
2025-01-17 17:09:47,937 INFO monitor.py:688 -- Starting monitor using ray installation: /usr/local/lib/python3.10/dist-packages/ray/__init__.py
2025-01-17 17:09:47,937 INFO monitor.py:689 -- Ray version: 2.40.0
2025-01-17 17:09:47,937 INFO monitor.py:690 -- Ray commit: 22541c38dbef25286cd6d19f1c151bf4fd62f2ed
2025-01-17 17:09:47,937 INFO monitor.py:691 -- Monitor started with command: ['/usr/local/lib/python3.10/dist-packages/ray/autoscaler/_private/monitor.py', '--logs-dir=/tmp/ray/session_2025-01-17_17-09-46_991438_28003/logs', '--logging-rotate-bytes=536870912', '--logging-rotate-backup-count=5', '--gcs-address=192.168.0.2:52229', '--monitor-ip=192.168.0.2']
2025-01-17 17:09:47,956 INFO monitor.py:159 -- session_name: session_2025-01-17_17-09-46_991438_28003
2025-01-17 17:09:47,958 INFO monitor.py:191 -- Starting autoscaler metrics server on port 44217
2025-01-17 17:09:47,960 INFO monitor.py:216 -- Monitor: Started
2025-01-17 17:09:47,977 INFO autoscaler.py:280 -- disable_node_updaters:False
2025-01-17 17:09:47,977 INFO autoscaler.py:288 -- disable_launch_config_check:True
2025-01-17 17:09:47,977 INFO autoscaler.py:300 -- foreground_node_launch:False
2025-01-17 17:09:47,977 INFO autoscaler.py:310 -- worker_liveness_check:True
2025-01-17 17:09:47,977 INFO autoscaler.py:318 -- worker_rpc_drain:True
2025-01-17 17:09:47,979 INFO autoscaler.py:368 -- StandardAutoscaler: {'cluster_name': 'default', 'max_workers': 0, 'upscaling_speed': 1.0, 'docker': {}, 'idle_timeout_minutes': 0, 'provider': {'type': 'readonly', 'use_node_id_as_ip': True, 'disable_launch_config_check': True}, 'auth': {}, 'available_node_types': {'ray.head.default': {'resources': {}, 'node_config': {}, 'max_workers': 0}}, 'head_node_type': 'ray.head.default', 'file_mounts': {}, 'cluster_synced_files': [], 'file_mounts_sync_continuously': False, 'rsync_exclude': [], 'rsync_filter': [], 'initialization_commands': [], 'setup_commands': [], 'head_setup_commands': [], 'worker_setup_commands': [], 'head_start_ray_commands': [], 'worker_start_ray_commands': []}
2025-01-17 17:09:47,982 INFO monitor.py:383 -- Autoscaler has not yet received load metrics. Waiting.
2025-01-17 17:09:53,062 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:09:53,062 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:09:53.062707 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
0.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
0B/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:09:53,064 INFO autoscaler.py:470 -- The autoscaler took 0.003 seconds to complete the update iteration.
2025-01-17 17:09:58,076 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:09:58,076 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:09:58.076616 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
10.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
0B/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:09:58,078 INFO autoscaler.py:470 -- The autoscaler took 0.003 seconds to complete the update iteration.
2025-01-17 17:10:03,152 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:10:03,153 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:10:03.152899 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
56.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
210.40MiB/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:10:03,154 INFO autoscaler.py:470 -- The autoscaler took 0.002 seconds to complete the update iteration.
2025-01-17 17:10:08,168 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:10:08,169 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:10:08.168996 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
27.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
185.98MiB/4.00GiB object_store_memory
Demands:
{'CPU': 1}: 116+ from request_resources()
2025-01-17 17:10:08,171 INFO autoscaler.py:470 -- The autoscaler took 0.003 seconds to complete the update iteration.
2025-01-17 17:10:13,184 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:10:13,184 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:10:13.184589 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
6.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
147.84MiB/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:10:13,186 INFO autoscaler.py:470 -- The autoscaler took 0.003 seconds to complete the update iteration.
2025-01-17 17:10:18,196 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:10:18,197 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:10:18.196921 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
4.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
147.84MiB/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:10:18,200 INFO autoscaler.py:470 -- The autoscaler took 0.004 seconds to complete the update iteration.
2025-01-17 17:10:23,212 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:10:23,213 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:10:23.213524 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
4.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
216.42MiB/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:10:23,216 INFO autoscaler.py:470 -- The autoscaler took 0.004 seconds to complete the update iteration.
2025-01-17 17:10:28,228 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:10:28,228 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:10:28.228654 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
0.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
147.91MiB/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:10:28,231 INFO autoscaler.py:470 -- The autoscaler took 0.003 seconds to complete the update iteration.
2025-01-17 17:10:33,243 INFO autoscaler.py:147 -- The autoscaler took 0.0 seconds to fetch the list of non-terminated nodes.
2025-01-17 17:10:33,244 INFO autoscaler.py:427 --
======== Autoscaler status: 2025-01-17 17:10:33.244288 ========
Node status
---------------------------------------------------------------
Active:
1 node_0df5098b63b61b261d6b38f00795725b5909c1b1e4c347b784459207
Pending:
(no pending nodes)
Recent failures:
(no failures)
Resources
---------------------------------------------------------------
Usage:
0.0/96.0 CPU
0.0/2.0 GPU
0B/2.00GiB memory
147.93MiB/4.00GiB object_store_memory
Demands:
(no resource demands)
2025-01-17 17:10:33,246 INFO autoscaler.py:470 -- The autoscaler took 0.003 seconds to complete the update iteration.
|