Senior Cloud Engineer with Prometheus and Grafana Job at Compunnel Inc., Dallas, TX

ekdOOEhsTHhqWEJJd2swcW9TUzJBOXNFQmc9PQ==
  • Compunnel Inc.
  • Dallas, TX

Job Description

Job Title: Senior Cloud Engineer - W2 only - Can provide sponsorship

Duration: Long Term

Location: Westlake, TX or Merrimack, NH - Hybrid - Local

This one we're going to be looking for someone who is really strong Prometheus and Grafana experience to help out as they migrate to OTEL Observability platform . Ideally, they'd like someone to have come from a Software Engineering background earlier in their career and they got into the Cloud Space, They will not be doing much programming and development like the other roles but will do some scripting working in automation with Python.

They'd be expected to provide L3 support as needed.

Top Skills :

  • Prometheus
  • Grafana
  • AWS, EC2, S3, Lambda

Nice to Have: Datadog, Kubernetes

Key responsibilities:

1. Monitoring and Alerting:

o Design and manage alerting rules for proactive issue identification and resolution.

o Continuously improve and expand monitoring coverage to meet evolving needs.

o Collaborate with teams to define alert thresholds and escalation procedures.

2 . Data Analysis and Visualization:

o Analyze metrics data to identify performance bottlenecks and areas for improvement.

o Create meaningful visualizations and reports to provide insights for stakeholders.

o Contribute to the enhancement of data retention and archiving strategies.

3. Scaling and Optimization:

o Collaborate with the infrastructure team to ensure seamless integration and scalability of Grafana and Prometheus.

o Fine-tune configurations to achieve optimal resource utilization and performance.

o Proven experience as an L3 Engineer specializing in Grafana and Prometheus administration.

o Proficiency in creating custom Grafana dashboards and queries.

o Strong understanding of monitoring best practices, alerting, and data analysis.

o Knowledge of time-series databases and storage strategies.

4. Automation and Development

o Scripting and automation skills for efficient system management.

o Building OTEL based component for Observability Stack

o Automation building Observability query language conversions

Job Tags

Local area,

Similar Jobs

Kinema Fitness

Part-Time General Manager Corporate Fitness & Wellness (Washington) Job at Kinema Fitness

 ...leadership and communication skills, along with a passion for wellness. Responsibilities include managing member engagement, developing...  ...in Exercise Science, CPR/AED certification, and experience in corporate fitness management. Competitive pay at $35/hr for 20 hours per... 

PRIME360

Material Handler/Pallet Repair Job at PRIME360

 ...Prime360 is one of the largest and fastest growing pallet management services companies offering the entire nation, including Canada and...  ...alongside their Manager, our team hand sort, inspect and repair wooden pallets to our Customers needs. Join us at our Customer... 

LanceSoft

Production Associate / Operator Job at LanceSoft

 ...Production Associate / Operator Stanley, Virginia 22851 Temp to Hire Pay Rate: $17.00/hr. - $19.00/hr. 2nd Shift: Monday to Thursday 3:00 pm to 1:30 am Job Description: Stacks of doors enter work area on conveyor, performs visual quality check of entire stack, repairs... 

Bain Capital

Hybrid Cloud Network Architect | Aruba/Cisco & IaC Expert (Boston) Job at Bain Capital

 ...designing, building, and operating network infrastructure across on-premises and public clouds, requiring deep expertise in Aruba and Cisco networking. Candidates should have at least 8 years of experience with cloud networking and automation tools such as Terraform. The... 

Crime Scene Resources, Inc

Digital Forensics Specialist Job at Crime Scene Resources, Inc

 ...Duties and Requirements Click to read more Duties Duties and Essential Functions of a Digital Forensics Specialist (AGO Senior Investigator/Analyst): This is an expert level position that is responsible for the secure collection, preservation, analysis...