close
Skip to content

Torchao SD3 int8wo #13266

@WasamiKirua

Description

@WasamiKirua

Describe the bug

Exception when click Run, SD3 quantization

Reproduction

SD3 Text Encoder Loader: Dtype bfloat16, Quantization: TorchAO, Quant Type: int8wo

Logs

20260314 21.08.15 [ERR] An unexpected error occurred during exec: cannot import name 'float8_dynamic_activation_float8_weight' from 'torchao.quantization' (/media/wasami/comfyui/Mellon/.venv/lib/python3.12/site-packages/torchao/quantization/__init__.py)
20260314 21.08.15 [ERR] Error executing node 50pz3Zy-IBsosDgI_t6gC (modules.Experiments.SD3TextEncodersLoader)
20260314 21.08.15 [ERR] Traceback (most recent call last):
  File "/media/wasami/comfyui/Mellon/mellon/NodeBase.py", line 216, in __call__
    output = getattr(self, self.CALLBACK)(**self.params)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/modules/Experiments/StableDiffusion3.py", line 190, in execute
    t5_encoder = self.mm_exec(lambda: quantize(t5_encoder, quantization, **kwargs), quant_device, exclude=[t5_encoder])
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/mellon/NodeBase.py", line 465, in mm_exec
    return memory_manager.exec(func, device, models, exclude, args, kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/utils/memory_menager.py", line 188, in exec
    raise e
  File "/media/wasami/comfyui/Mellon/utils/memory_menager.py", line 172, in exec
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/modules/Experiments/StableDiffusion3.py", line 190, in <lambda>
    t5_encoder = self.mm_exec(lambda: quantize(t5_encoder, quantization, **kwargs), quant_device, exclude=[t5_encoder])
                                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/utils/quantization.py", line 52, in quantize
    torchao(model, quant_type=torchao_quant_type)
  File "/media/wasami/comfyui/Mellon/utils/quantization.py", line 70, in torchao
    dtype = get_torchao_quant_method(quant_type)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/utils/quantization.py", line 96, in get_torchao_quant_method
    from torchao.quantization import (
ImportError: cannot import name 'float8_dynamic_activation_float8_weight' from 'torchao.quantization' (/media/wasami/comfyui/Mellon/.venv/lib/python3.12/site-packages/torchao/quantization/__init__.py)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/media/wasami/comfyui/Mellon/mellon/server.py", line 1015, in execute_node
    self.node_cache[id](**args)
  File "/media/wasami/comfyui/Mellon/mellon/NodeBase.py", line 220, in __call__
    raise RuntimeError(f"Error executing {self.module_name}.{self.class_name}: {e}")
RuntimeError: Error executing modules.Experiments.SD3TextEncodersLoader: cannot import name 'float8_dynamic_activation_float8_weight' from 'torchao.quantization' (/media/wasami/comfyui/Mellon/.venv/lib/python3.12/site-packages/torchao/quantization/__init__.py)

20260314 21.08.15 [ERR] Error occurred in Traceback (most recent call last):
  File "/media/wasami/comfyui/Mellon/mellon/NodeBase.py", line 216, in __call__
    output = getattr(self, self.CALLBACK)(**self.params)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/modules/Experiments/StableDiffusion3.py", line 190, in execute
    t5_encoder = self.mm_exec(lambda: quantize(t5_encoder, quantization, **kwargs), quant_device, exclude=[t5_encoder])
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/mellon/NodeBase.py", line 465, in mm_exec
    return memory_manager.exec(func, device, models, exclude, args, kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/utils/memory_menager.py", line 188, in exec
    raise e
  File "/media/wasami/comfyui/Mellon/utils/memory_menager.py", line 172, in exec
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/modules/Experiments/StableDiffusion3.py", line 190, in <lambda>
    t5_encoder = self.mm_exec(lambda: quantize(t5_encoder, quantization, **kwargs), quant_device, exclude=[t5_encoder])
                                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/utils/quantization.py", line 52, in quantize
    torchao(model, quant_type=torchao_quant_type)
  File "/media/wasami/comfyui/Mellon/utils/quantization.py", line 70, in torchao
    dtype = get_torchao_quant_method(quant_type)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/utils/quantization.py", line 96, in get_torchao_quant_method
    from torchao.quantization import (
ImportError: cannot import name 'float8_dynamic_activation_float8_weight' from 'torchao.quantization' (/media/wasami/comfyui/Mellon/.venv/lib/python3.12/site-packages/torchao/quantization/__init__.py)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/media/wasami/comfyui/Mellon/mellon/server.py", line 318, in _main_worker
    result = await self.loop.run_in_executor(None, partial(task, *args))
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/wasami/.local/share/uv/python/cpython-3.12.12-linux-x86_64-gnu/lib/python3.12/concurrent/futures/thread.py", line 59, in run
    result = self.fn(*self.args, **self.kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/media/wasami/comfyui/Mellon/mellon/server.py", line 916, in execute_graph
    self.execute_node(id, nodes[id], sid)
  File "/media/wasami/comfyui/Mellon/mellon/server.py", line 1019, in execute_node
    raise e
  File "/media/wasami/comfyui/Mellon/mellon/server.py", line 1015, in execute_node
    self.node_cache[id](**args)

System Info

Ubuntu 24.04
RTX 5090

Who can help?

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions