runtime error
Exit code: 1. Reason: 35/435 [00:00<00:00, 3.73MB/s] config.json: 0%| | 0.00/1.53k [00:00<?, ?B/s][A config.json: 100%|██████████| 1.53k/1.53k [00:00<00:00, 10.1MB/s] WARNING - Exllamav2 kernel is not installed, reset disable_exllamav2 to True. This may because you installed auto_gptq using a pre-build wheel on Windows, in which exllama_kernels are not compiled. To use exllama_kernels to further speedup inference, you can re-install auto_gptq from source. WARNING - CUDA kernels for auto_gptq are not installed, this will result in very slow inference speed. This may because: 1. You disabled CUDA extensions compilation by setting BUILD_CUDA_EXT=0 when install auto_gptq from source. 2. You are using pytorch without CUDA support. 3. CUDA and nvcc are not installed in your device. /usr/local/lib/python3.10/site-packages/huggingface_hub/file_download.py:943: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`. warnings.warn( Traceback (most recent call last): File "/home/user/app/app.py", line 11, in <module> model = AutoGPTQForCausalLM.from_quantized( File "/usr/local/lib/python3.10/site-packages/auto_gptq/modeling/auto.py", line 135, in from_quantized return quant_func( File "/usr/local/lib/python3.10/site-packages/auto_gptq/modeling/_base.py", line 955, in from_quantized is_sharded, resolved_archive_file, true_model_basename = get_checkpoints(model_name_or_path=model_name_or_path, extensions=extensions, possible_model_basenames=possible_model_basenames, **cached_file_kwargs) File "/usr/local/lib/python3.10/site-packages/auto_gptq/modeling/_utils.py", line 734, in get_checkpoints raise FileNotFoundError( FileNotFoundError: Could not find a model in TheBloke/medicine-chat-GPTQ with a name in medicine-chat.Q4_K_M.gguf.safetensors.index.json, medicine-chat.Q4_K_M.gguf.safetensors. Please specify the argument model_basename to use a custom file name.
Container logs:
Fetching error logs...