ML Package got stuck in "Deploying" status

Parmar_Snehal_Cognizant · March 5, 2024, 4:22pm

Hi All,

In AI center, I have been trying to Train my ML model third time, I have Added new set of documents & done the data labelling and created dataset & exported the same after that I have created the pipeline that also completed successfully & trying to deploy the ML model and update the ML skill, but my ML model package is still in Deploying status only pas few days (since last Friday 1st March), status is not getting changed to Failed to identify the issue, can anyone please help me to figure out what’s wrong with ML package deployment? will it take longer time to deploy the package?

Thanks in Advance!!!

Anil_G · March 5, 2024, 5:14pm

@Parmar_Snehal_Cognizant

Sometimes it might take longer…also check if you have enough licenses

If already a package is deployed this might not be…

Cheers

Parmar_Snehal_Cognizant · March 5, 2024, 5:23pm

yes I have enough license,6k AI units license its showing under license

Anil_G · March 5, 2024, 5:28pm

@Parmar_Snehal_Cognizant

Is there already few packages deployed?

Also you can open ml logs on the last tab and check if it shows retrying or so

Cheers

Parmar_Snehal_Cognizant · March 5, 2024, 5:31pm

yes one deployment was already done Months back, ML logs showing MLPackage validation successful, its not showing Retry option

Anil_G · March 5, 2024, 5:36pm

@Parmar_Snehal_Cognizant

Can you try to deactivate that package…may be you can deploy only one at a time

Cheers

Parmar_Snehal_Cognizant · March 5, 2024, 5:48pm

I checked but there is no option to disable the currently deployed package

Anil_G · March 5, 2024, 5:49pm

@Parmar_Snehal_Cognizant

Can you say how many active and deployed pipelines show on the project?

Cheers

Parmar_Snehal_Cognizant · March 5, 2024, 5:56pm

only 1 package is active currently

Anil_G · March 5, 2024, 6:41pm

@Parmar_Snehal_Cognizant

are you findign a log like this?

go to ml skill try to delete old one and then deploy

check in ml skills tab may be already one is there

cheers

Parmar_Snehal_Cognizant · March 6, 2024, 5:35am

I can see the logs like ML package validation started and validation success, but package is still under deploying status and for pipeline also I can see the logs pipeline run successful and under ML skill I dont see any logs for the latest deployment but in ML skill one tab is there streaming log there I can see one message
“968357b9-fad6-41db-aa8b-a73934368ed2-21-4-6c85b9c5f9-zjzlq:Warning ==> 0/3 nodes are available: 1 Insufficient memory, 1 node(s) had untolerated taint {nvidia.com/gpu: present}, 1 node(s) had untolerated taint {task.mining/cpu: present}. preemption: 0/3 nodes are available: 1 No preemption victims found for incoming pod, 2 Preemption is not helpful for scheduling.”

Parmar_Snehal_Cognizant · March 6, 2024, 5:39am

I can see the log message for pipe line and ML package but for ML skill I can see “MLSkill ICC_Billing_Invoice MLPackage v#23.10.0 Deployment Started” and “MLSkill ICC_Billing_Invoice MLPackage v#23.10.0 Deployment Failed Attempt: 1” with below error message

“968357b9-fad6-41db-aa8b-a73934368ed2-21-4-6c85b9c5f9-zjzlq:Warning ==> 0/3 nodes are available: 1 Insufficient memory, 1 node(s) had untolerated taint {nvidia.com/gpu: present}, 1 node(s) had untolerated taint {task.mining/cpu: present}. preemption: 0/3 nodes are available: 1 No preemption victims found for incoming pod, 2 Preemption is not helpful for scheduling.”

Anil_G · March 6, 2024, 1:02pm

@Parmar_Snehal_Cognizant

delete the already deployed model and next one you cna see

cheers

Parmar_Snehal_Cognizant · March 6, 2024, 2:17pm

ok so under ML package its only allowing me to delete undeployed package & in ML skills if I delete it then is it like will loose all other versions?

Anil_G · March 6, 2024, 2:38pm

@Parmar_Snehal_Cognizant

Please check under ml skill and not ml package

Cheers

system · March 9, 2024, 2:38pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deploying Status Bug AI Center	2	883	March 8, 2022
ML Package in "Deploying" Status AI Center question , ai_center	5	507	January 4, 2024
ML skill Error: The model is getting successfully trained but not deploying AI Center	7	195	April 12, 2024
ML skill status in deployed AI Center question , ai_center	2	605	December 22, 2022
ML skill failed in AI center AI Center question , ai_center	2	985	June 21, 2022

Most Active Users - Yesterday
prashant1603765
ashokkarale
Anil_G
postwick
V_Roboto_V
sharazkm32
yedukondaluaregala
Christian_ISGS
Darshan_Sable
pikorpa
More details...

ML Package got stuck in "Deploying" status

Related topics