AIOps Engineer Job at Sumeru Solutions, Frisco, TX

a0krdDBzcmt6eHNuQmdCWU0xSnVCOUpJ
  • Sumeru Solutions
  • Frisco, TX

Job Description

AIOps Engineer

Frisco, TX

Job Overview: The AIOps Engineer is responsible for integrating machine learning and advanced analytics into our existing monitoring and logging systems. This role will leverage artificial intelligence to automate routine operational tasks, detect anomalies proactively, and implement self-healing frameworks to enhance the stability and performance of our infrastructure. The ideal candidate will be proactive in identifying gaps, creating strategic roadmaps, and implementing phased improvements to achieve operational excellence.

Key Responsibilities:

Apply machine learning algorithms to existing operational data (logs, metrics, events) to predict system failures and proactively address potential incidents.

Implement automation for routine DevOps practices including automated scaling, resource optimization, and controlled restarts.

Develop and maintain self-healing systems to reduce manual intervention and enhance system reliability.

Build anomaly detection models to quickly identify and address unusual operational patterns.

Collaborate closely with SREs, developers, and infrastructure teams to continuously enhance the operational stability and performance of the system.

Provide insights and improvements through visualizations and reports leveraging AI-driven analytics.

Create a phased roadmap to incrementally enhance operational capabilities and align with strategic business goals.

Required Skills and Qualifications:

Strong experience with AI/ML frameworks and tools (e.g., TensorFlow, PyTorch, scikit-learn).

Proficiency in data processing and analytics tools (e.g., Splunk, Prometheus, Grafana, ELK stack).

Solid background in scripting and automation (Python, Bash, Ansible, etc.).

Experience with cloud environments and infrastructure automation.

Proven track record in implementing proactive monitoring, anomaly detection, and self-healing techniques.

Excellent analytical, problem-solving, and strategic planning skills.

Strong communication skills and the ability to effectively collaborate across teams.

Preferred Experience:

Background in DevOps/Site Reliability Engineering.

Familiarity with containerization and orchestration platforms (Kubernetes, Docker).

Experience in building scalable, distributed systems.

This role is pivotal in enabling our organization to achieve and sustain Operational Excellence through intelligent automation and proactive monitoring practices.

Job Tags

Similar Jobs

Safran

Security Solutions Administrator M-F Job at Safran

As a Security Solutions Administrator, you will be responsible for securing, managing, and optimizing our Microsoft cloud environment while ensuring compliance with security frameworks and protecting against cyber threats. As a member of the Cybersecurity team, you will...

1871 Member Company

TexChange Unbrokered - Chief Financial Officer (CFO) Job at 1871 Member Company

 ...fabrication & manufacturing SMEs. About the Role: We seek a highly motivated and experienced fractional Chief Financial Officer (CFO) to lead our financial strategy and operations. This is a ground-floor opportunity with significant equity-based compensation and... 

Caterpillar - Energy & Transportation

Quality Specialist Job at Caterpillar - Energy & Transportation

 ...Your Work Shapes the World at Caterpillar Inc. When you join Caterpillar, you'rejoining a global team who cares not just about the work we do but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable... 

Brooklyn Kura

Sales Representative Job at Brooklyn Kura

 ...Sales Representative & Brand Ambassador At Brooklyn Kura, we are committed to a new tradition of American Craft Sake. Building on more...  ...sales performance data Partner with our distributor (Skurnik Wines) on account priorities, work-withs, and outreach Organize and... 

Kittitas Interactive Management

Complex Needs Life Enrichment Coach - Float Job at Kittitas Interactive Management

 ...support as well as providing daily living support to KIM clients in their home and in the community. Complex LECs will strive to motivate and encourage clients toward their goals, reflecting the Residential Service Guidelines. KIM is looking for flexible candidates to...