Skip to content

Instantly share code, notes, and snippets.

@mwestwood
Last active August 22, 2023 14:07
Show Gist options
  • Save mwestwood/30636e9e07ff70db65fb5c6127de9764 to your computer and use it in GitHub Desktop.
Save mwestwood/30636e9e07ff70db65fb5c6127de9764 to your computer and use it in GitHub Desktop.
"503 Service Unavailable: The server is unable to handle the request due to a temporary condition. Please try again later." -> Load balancer issue
"503 Service Unavailable: The requested service is currently unavailable due to server overload." -> High traffic volume
"503 Service Unavailable: The server is currently undergoing maintenance. Please try again later." -> Scheduled maintenance
"503 Service Unavailable: The server is busy or down for maintenance. Please try again later." -> Server maintenance
"503 Service Unavailable: The service you are trying to access is currently unavailable. We apologize for the inconvenience." -> Service outage
"503 Service Unavailable: The server is under heavy load and unable to respond to your request at the moment." -> High server load
"503 Service Unavailable: The server is currently unable to handle the request due to a high load. Please try again later." -> High server load
"503 Service Unavailable: Our system is undergoing maintenance. Please check back shortly." -> System maintenance
"503 Service Unavailable: The service you are trying to access is temporarily down. We are working to restore it as soon as possible." -> Service outage
"503 Service Unavailable: The server is currently unavailable due to maintenance or other technical issues." -> Maintenance or technical issues
"503 Service Unavailable: The server is temporarily unable to process your request. Please try again later." -> Temporary server issue
"503 Service Unavailable: The server is unable to respond to requests at this time." -> Server responsiveness issue
"503 Service Unavailable: The server is experiencing difficulties and cannot process your request." -> Server issues
"503 Service Unavailable: The server is currently experiencing high traffic and cannot process your request." -> High traffic load
"503 Service Unavailable: The server is overloaded and unable to handle incoming requests." -> Server overload
"503 Service Unavailable: The requested service is currently down due to technical problems." -> Technical problems
"503 Service Unavailable: The server is temporarily unable to handle the request due to a maintenance event." -> Maintenance event
"503 Service Unavailable: The server is currently out of service and cannot respond to your request." -> Server outage
"503 Service Unavailable: The service you are trying to access is temporarily unavailable due to system issues." -> System issues
"503 Service Unavailable: The server is facing issues and cannot process your request. Please try again later." -> Server issues
"Load balancer issue", "High traffic volume", "Scheduled maintenance", "Server maintenance", "Service outage", "High server load", "System maintenance", "Maintenance or technical issues", "Temporary server issue", "Server responsiveness issue", "Server issues", "High traffic load", "Server overload", "Technical problems", "Maintenance event", "Server outage", "System issues"
'Stale database connection detected' -> Stale Database Connection
'Database connection idle for too long. Please retry.' -> Stale Database Connection
'Connection timeout due to stale database link.' -> Stale Database Connection
'Database connection lost due to inactivity.' -> Stale Database Connection
'Failed to execute query due to a stale connection.' -> Stale Database Connection
'Connection closed by server due to being stale.' -> Stale Database Connection
'Stale connection caused query failure.' -> Stale Database Connection
'Query aborted due to a stale database link.' -> Stale Database Connection
'Connection dropped due to being idle for a prolonged period.' -> Stale Database Connection
'Failed to establish new connection due to stale connections.' -> Stale Database Connection
'Database connection expired due to inactivity.' -> Stale Database Connection
'Stale connection detected while trying to fetch data.' -> Stale Database Connection
'Connection pool exhausted due to stale connections.' -> Stale Database Connection
'Failed to maintain active connection, becoming stale.' -> Stale Database Connection
'Stale database connection caused query timeout.' -> Stale Database Connection
'Connection closed unexpectedly due to staleness.' -> Stale Database Connection
'Stale connection resulted in transaction failure.' -> Stale Database Connection
'Query failed due to a connection being stale.' -> Stale Database Connection
'Connection dropped due to being idle beyond limit.' -> Stale Database Connection
'Stale connection impacted data retrieval.' -> Stale Database Connection

When you encounter a "503 Service Unavailable" error for an application running on AWS EKS, it's important to perform health checks to diagnose and address the underlying issues. Here are some health checks you can run to troubleshoot the problem:

1. **Check Application Logs:** Review the application logs for any errors or warnings that could provide insights into the cause of the issue. Look for any patterns or anomalies.

2. **Examine Server Metrics:** Monitor server metrics such as CPU usage, memory usage, and network throughput. Elevated resource utilization could be causing the service to become unavailable.

3. **Database Connectivity:** Ensure that the application can connect to the associated databases or services. Database outages or connection issues can lead to a "503 Service Unavailable" error.

4. **Load Balancer Health:** If you're using a load balancer, check its health status. Ensure that all instances behind the load balancer are healthy and responding properly.

5. **Instance Status:** Verify the status of your EKS instances. Instances that are in an unhealthy state could contribute to the service unavailability.

6. **Network Connectivity:** Check network connectivity between your EKS nodes, load balancers, and databases. Network issues can result in service unavailability.

7. **Service Dependency Health:** Review the health of any external services or dependencies your application relies on. If these services are down or experiencing issues, they can cause your application to become unavailable.

8. **Application Deployment:** Ensure that recent changes or deployments have not introduced errors or issues that could lead to the "503 Service Unavailable" error.

9. **Resource Quotas:** Confirm that you haven't exceeded resource quotas, such as CPU or memory limits, which could impact your application's performance.

10. **Application Scaling:** If you're using auto-scaling, check if the instances are scaling appropriately based on demand. An improperly configured auto-scaling setup can lead to unavailability during high traffic.

11. **Container Health:** Monitor the health of your application's containers within the pods. Check if any containers are failing or restarting frequently.

12. **Pod Status:** Check the status of your pods. If pods are in a crash loop or pending state, it could contribute to service unavailability.

13. **Kubernetes Events:** Review Kubernetes events for any warnings or errors related to your pods, nodes, or services. These events can provide insights into the issue.

14. **Service Configuration:** Verify the configuration of your Kubernetes services and ingress controllers. Misconfigured networking can lead to service unavailability.

15. **Pod Resource Requests and Limits:** Ensure that your pods have appropriate resource requests and limits set. Incorrect resource allocation can impact pod stability.

16. **Security Groups and Firewall Rules:** Check your security group and firewall rules to ensure that there are no restrictions preventing traffic flow to your application.

17. **Availability Zones:** Verify the availability zones your EKS nodes are deployed in. Issues with a specific availability zone can impact application availability.

18. **Horizontal Pod Autoscaling:** If you're using Horizontal Pod Autoscaling, ensure that it's working as expected and adjusting the number of replicas based on demand.

19. **EKS Cluster Autoscaler:** If enabled, check the EKS cluster autoscaler to ensure it's scaling the nodes appropriately based on resource demands.

20. **Third-Party Integrations:** Review any third-party integrations your application uses. Issues with these integrations can affect your application's availability.

Performing these health checks systematically can help you identify and resolve the root cause of the "503 Service Unavailable" error and restore the availability of your application on AWS EKS.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment