Site Reliability-Engineering Jobs

40 jobs found

full time
onsite/hybrid in hong-kong hong-kong

# Senior Backend Engineer - Instant Messaging Chat **Locations:** Asia / Hong Kong / Taiwan, Taipei / New Zealand, Auckland / New Zealand, Wellington / Australia, Melbourne / Australia, Sydney / Australia, Brisbane / UAE, Dubai **Department:** Engineering – Backend **Commitment:** Full-time: Remote **Workplace Type:** Remote **Binance** is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world. ## Responsibilities - Lead the design and development of new instant messaging features, ensuring the system can handle high concurrency with strong performance, scalability, and reliability. - Build and maintain microservices based on Spring Cloud, including service discovery, configuration management, load balancing, and traffic governance. - Work with large-scale data pipelines to analyze and process message data, supporting product decisions and improving system efficiency. - Design and optimize storage and retrieval architectures for massive datasets, ensuring stable and efficient data operations. - Drive performance tuning, handle production incidents, and lead major refactoring efforts to improve overall system stability and throughput. ## Requirements - Hands-on experience building or maintaining instant messaging platforms such as WeChat, QQ, Telegram, WhatsApp, Slack, or similar real-time communication systems. - Strong proficiency in Java and Spring Boot, with familiarity in distributed systems. - Strong knowledge of Linux, microservices, distributed systems, Redis sharding, database sharding, Kafka, and MQ. - Proven ability to independently design and deliver a high-performance, high-throughput, and highly available backend system that has been successfully deployed in production. - Deep understanding of database storage engines, indexing, partitioning/sharding strategies, and real-world performance tuning practices. ## Why Binance - Shape the future with the world’s leading blockchain ecosystem - Collaborate with world-class talent in a user-centric global organization with a flat structure - Tackle unique, fast-paced projects with autonomy in an innovative environment - Thrive in a results-driven workplace with opportunities for career growth and continuous learning - Competitive salary and company benefits - Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team) Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success. *By submitting a job application, you confirm that you have read and agree to our **Candidate Privacy Notice**.* *We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.* **Apply for this job** *When applying, mention the word CANDYSHOP to show you read the job post completely.*

javablockchainweb3+5 更多
查看详情
full time
onsite/hybrid in zug switzerland

## Technical Lead - GPU/Compute Infrastructure **Location:** Zug **Department:** ICN **Commitment:** Full-time **Workplace Type:** Remote **Please note: This is an ICN role being handled by the IC GmbH applicant tracking system.** ### About Us At Impossible Cloud Network (ICN) we’re building the world’s largest independent data center network for high-performance, enterprise-grade AI storage and compute. Founded by seasoned entrepreneurs who previously built a billion-euro tech company, we're on a mission to revolutionize the cloud industry. We enable our customers to achieve real data sovereignty as they retain full data control & custody with verifiable enterprise-grade privacy guarantees – all outside hyperscalers. ### The Role Lead the technical vision for GPU/compute infrastructure. By yielding your deep understanding of AI-native workloads, you guide the expansion of our network by helping Hardware Providers select hardware and data centers and help Service Providers navigate which available hardware cluster has the best compute and storage resources for their workload. You will serve as the top technical authority, leading architectural decisions, technology stack choices, and deployment strategy. You will be responsible for translating our market vision into a robust, high-performance, and decentralized system. ### What We Are Looking For We are searching for an exceptional, entrepreneurial technical leader to partner with the executive team and take complete ownership of the project's technical architecture, vision, and execution. This is a dynamic, high-impact role for a self-starter who is ready to move beyond corporate constraints and drive value by building a foundational, scalable product from Day One. If you are driven by deep technical ownership, impact, and the challenge of building a cutting-edge platform this is your opportunity. Some of the key skills that will help you in this role include: - Proven experience with Bare Metal as a Service (BMaaS), including exposure to the GPU as a Service (GPUaaS) concepts market. - Familiarity with infrastructure providers such as Hetzner, OVHcloud, CoreWeave or a NeoCloud is a strong asset. - Competence in designing, building, and scaling complex, distributed backend systems (Cloud, Web3, or HPC environments). - Deep understanding of diverse AI workloads (e.g., LLM training, inference, data analytics) and the corresponding compute and data orchestration strategy required for each. - Extensive background in AI/Machine Learning infrastructure, distributed compute, or high-performance computing (HPC). - Readiness to operate remotely while committing to regular, **in-person collaboration sessions in Zug, Hamburg and other European locations.** - Strong analytical and problem-solving abilities, coupled with the adaptability required to thrive in a fast-paced, zero-to-one startup environment. - Experience with decentralized physical infrastructure networks (DePIN), or managing dynamic compute resource allocation is an advantage but not essential. ### Our Culture We believe the best ideas are forged in person. That's why we value regular, in-person collaboration and open communication at our Zug, Switzerland headquarters. If you're an innovative thinker with a passion for success in decentralized technology and the cloud industry, we want you on our team. At our core, we're a workplace that values your well-being, fosters a vibrant and collaborative atmosphere, and empowers you to shape the future of the cloud. ### Our Hiring Process 1. **Application Submission** We encourage you to kickstart your application by submitting your comprehensive LinkedIn profile or CV along with the designated application form. 2. **Kickoff Call for Selected Candidates** Successful candidates will be invited to participate in a Kickoff call, where we aim to explore your qualifications, experiences, and expectations. 3. **Efficient Interview Process** Our commitment is to complete the hiring process in 2 to 4 additional remote and/or on-site steps, according to the specific role and its seniority level. We believe in moving swiftly to welcome exceptional talent into our dynamic workplace. *We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.* **Apply for this job** When applying, mention the word **CANDYSHOP** to show you read the job post completely.

machine-learningweb3cloud+1 更多
查看详情
full time
remote in london united-kingdom

**Stay in the loop.** Follow @raikucom on Twitter for product updates, engineering deep dives, and a closer look at how we’re building the future of blockspace. ## Senior Validator Engineer ### About Raiku Raiku is reengineering blockchain infrastructure from first principles to make global digital markets as precise and dependable as physical systems. Built on Solana, our platform delivers deterministic execution, guaranteed inclusion, and low-latency performance—solving the foundational failures that cause transactions to miss, trades to revert, and systems to collapse under pressure. By placing high-performance compute close to where transactions happen and coordinating execution through our advanced scheduling engine, Raiku empowers developers to build scalable, high-performance applications—and gives institutions the reliability and control they demand. We believe financial infrastructure should behave like physics: fast, reliable, and predictable—every time, without exception. ### The Role As a **Senior Validator Engineer** at Raiku, you will spearhead our engineering efforts surrounding the Solana Validator (both Agave and Firedancer), driving the innovation and implementation of next-generation high-performance distributed systems. You will work with a highly proficient team of core engineers who have all contributed to foundational and novel network components. We believe in hiring only exceptional individuals who are highly motivated to work on complex core infrastructure challenges and are motivated by a coherent system design philosophy that will drive our industry forward. Expect frequent group discussions on architecture, new design specs and code reviews. We are all deeply committed to frequently shipping quality code. ### About the Engineering Team As a team, we are building an entire platform written in Rust, which connects the L1 and L2s and allows us to create deeply complex scenarios and interactions with the running network. While your primary focus for this role is Rust and complex interactions with different components, you may also be involved with other types of development, including exposure to the C-coded Firedancer validator. Working at every level of the stack is necessary to understand the big picture and how everything is wired together. ### Responsibilities: - Highly motivated to contribute to our mission and be part of something bigger. Excited to work on projects that are ground-breaking and complex - Refactor, improve and lead software design and implementation - Communicate effectively with the team and document your code. It is also expected that senior engineers mentor less experienced engineers. - Read and understand protocol specifications and be able to break them into issues and turn them into code. - Maintaining a large codebase with many components and keeping it well-designed, future-proofed, modular and highly performant - Automate security testing processes and benchmarks, creating innovative tools and frameworks for continuously improving our systems. ### What You'll Bring: - A bachelor's or master's degree in Computer Science, Engineering, or a related technical field, coupled with practical experience in blockchain systems. - At least 3 years of experience in blockchain and distributed systems, with a deep technical understanding of L1 and L2 architectures. Ideally Solana. - Experience in writing efficient low-level code. - Strong proficiency in Rust, with at least 2-3 years of Rust experience. - Demonstrated experience in designing, developing, and deploying scalable distributed systems. - An analytical mindset with the ability to anticipate and mitigate emerging security threats, leveraging a comprehensive understanding of the blockchain landscape. ### Preferred Qualifications: - Proficiency and experience in writing C code. - Direct involvement in the Solana ecosystem, with contributions to Solana's runtime, scheduler, or other core systems. - Active open-source contributions in core protocol engineering (such as Firedancer, Reth, Lighthouse, Geth, …). - Active engagement with the blockchain security research community, evidenced by contributions to open-source projects, publications, or presentations at notable conferences. ### Benefits: - Competitive remuneration packages based on iterative market research, including tokens - Remote-first and self-initiating with flexible hours - Work with team players who are genuinely excited about their impact and projects. - A dynamic and collaborative work environment that fosters innovation and recognises individual contributions to collective success - Opportunities for professional growth and advancement within a rapidly growing technological frontier *When applying, mention the word CANDYSHOP to show you read the job post completely.*

rustsolanablockchain+5 更多
查看详情
full time
remote in France

🇫🇷 This job ad is written in French. 🇫🇷 🌍 À propos de Scalingo Scalingo est une startup technologique en forte croissance. Notre plateforme cloud européenne, robuste et souveraine, libère les équipes techniques des contraintes d’infrastructure, pour leur permettre de se concentrer sur ce qui compte vraiment : créer, innover et délivrer. Notre PaaS permet de déployer et d’héberger facilement des applications web et des bases de données, sans avoir à gérer l’administration système ou l’infrastructure sous-jacente. Nous accompagnons une grande diversité de clients — startups, scale-ups, grands groupes et institutions publiques — parmi lesquels le Ministère de l’Intérieur ou ENGIE, avec une exigence élevée en matière de fiabilité, de sécurité et de qualité de service. 🎯 Ton rôle chez Scalingo En tant que Senior Site Reliability Engineer, tu occupes une position clé à l’interface des équipes développement, infrastructure, sécurité et support.A terme, nous ennvisageons une évolution vers un rôle managérial. Ton rôle est à la fois : technique, avec un fort impact sur la fiabilité et la performance de la plateforme, structurant, en faisant évoluer les pratiques et les outils SRE et audelà., fédérateur, en accompagnant et faisant monter en compétence une équipe SRE de 2 personnes. Tu interviens aussi bien sur le fonctionnement quotidien de l’activité SRE que sur les projets stratégiques liés à la croissance de la plateforme. Référent ou référente technique, tu incarnes les bonnes pratiques SRE et contribues à diffuser une culture de la fiabilité, de l’automatisation et de l’excellence opérationnelle au sein de Scalingo. 🧩 Pourquoi ce rôle est essentiel Garantir la stabilité, la disponibilité et la résilience des systèmes en production. Anticiper les défaillances et structurer des réponses efficaces aux incidents. Industrialiser et automatiser l’exploitation de la plateforme. Maintenir un haut niveau de qualité de service vis-à-vis de nos clients et de nos engagements contractuels (SLA). Chaque amélioration que tu apportes contribue directement à la robustesse de la plateforme, à la réduction des incidents, à la maîtrise des coûts opérationnels et à l’accompagnement de la croissance de Scalingo. 🤝 Organisation & évolution Rattaché directement à un Engineering Manager, tu exerces un leadership technique et opérationnel fort, sans responsabilité hiérarchique directe dans un premier temps. À moyen terme, nous souhaitons que ce rôle évolue vers le management hierarchique de l’équipe SRE. Si cette perspective t’intéresse, nous t’accompagnerons activement dans ta montée en compétences managériale. Vos missions Leadership technique et animation de l’équipe SRE Encadrer techniquement l’équipe SRE au quotidien : accompagnement, priorisation, revue des choix techniques et des implémentations. Guider, former et faire monter en compétence les membres de l’équipe, en favorisant l’autonomie et la prise d’initiative. Transmettre les bonnes pratiques SRE (fiabilité, observabilité, gestion d’incidents, automatisation). Être moteur dans l’organisation du travail de l’équipe (processus, rituels, documentation). Porter la vision technique SRE et la décliner dans les projets structurants. Fiabilisation et amélioration continue des services Analyser les performances, identifier les points de contention et proposer des améliorations pour optimiser l’utilisation des ressources et la montée en charge. Définir, mettre en place et améliorer les outils d’observabilité (monitoring, métriques, logs, alerting), avec une approche proactive de la détection d’incidents. Rédiger des processus d’exploitation, les maintenir et les faire évoluer. Assurer une veille technologique continue afin de proposer des évolutions pertinentes de l’infrastructure. Gestion des incidents et support Assurer en partie le support client de niveau 3, en lien avec les équipes support et selon les SLA. Participer activement à la gestion des incidents, ainsi qu'aux cycles d'astreintes (environ une demi-semaine toutes les trois semaines). Intervenir rapidement lors des incidents critiques afin d’en limiter l’impact et d’assurer la continuité des services. Piloter et animer les rétrospectives d’incidents (post-mortems), en identifiant les causes racines et en définissant des actions correctives durables. Rédiger et publier les rapports post-mortem à la suite des incidents majeurs. Assurer la coordination et la communication de crise, en interne comme auprès des clients. Sécurité, conformité et continuité d’activité Veiller au respect des engagements de service (SLA, RPO, RTO) sur le périmètre SRE. Mettre en place des indicateurs de mesure de la qualité des services (SLO). Contribuer activement à la conformité ISO 27001 et HDS : respect des processus, participation aux audits internes et externes. Planifier, exécuter et analyser les tests réguliers des dispositifs de continuité et de reprise d’activité (PCA/PRA). Collaboration interne et contribution transverse Collaborer étroitement avec les équipes de développement afin d’intégrer les exigences d’exploitabilité (fiabilité, performance, sécurité opérationnelle) dès la conception. Être force de proposition auprès des équipes produit et techniques sur les sujets de fiabilité, d’expérience client et des outils d'administration. Contribuer à la rédaction, à la structuration et au maintien d’une documentation opérationnelle claire et à jour. Vos compétences 🔎 Ce que tu sais faire en arrivant : Une solide expertise des environnements cloud et infrastructures distribuées, avec une culture forte de la haute disponibilité et de la fiabilité en production. Une maîtrise des pratiques d’observabilité (logs, métriques, alerting) et une capacité de diagnostic structurée sur des incidents complexes. Une bonne compréhension des environnements conteneurisés et de leurs enjeux opérationnels. Des compétences confirmées en bases de données en production : fiabilité, sauvegardes, restauration, réplication et montée en charge. Une pratique de l’Infrastructure as Code et de l’automatisation des environnements. Une sensibilité aux enjeux de sécurité opérationnelle. Une aisance dans l’utilisation des outils d’Intelligence Artificielle pour gagner en efficacité au quotidien. Une capacité à évoluer dans des contextes complexes, changeants ou incertains, avec rigueur et fiabilité. Une aisance dans la priorisation, y compris en situation d’incident. Une communication claire et structurée, un goût pour la collaboration transverse et le partage des connaissances. Une posture blameless, de la curiosité technique, du sang-froid et une attention portée à l’impact utilisateur. Une capacité à exercer un leadership technique, à transmettre et à faire progresser les pratiques collectives. Avantages Full remote avec 1 déplacement par trimestre (Strasbourg ou autre ville) Evenéments d'entreprise : 1 Offsite annuel et des afterworks réguliers Prime de télétravail (57,60€) Ticket Restaurant (11,52 € par unité) et carte Swile avec ses avantages Mutuelle prise en charge à 100% par Scalingo (BENEFIZ) Horaires flexibles en convention de forfait horaires (RTT) Ordinateur portable sous Linux Budget d'équipements complémentaires (participation) 🧭 Processus de recrutement Call de pré-qualification (30 min) : nous t’appelons pour te présenter l’offre et la clarifier si besoin. C’est toi qui décides si tu souhaites poursuivre l’étape suivante. Test de pré-screening (30 min) : un test standardisé de type QCM, à passer en ligne. Il nous permet d’évaluer les candidatures de manière objective, en limitant les biais de recrutement. Une note minimale est requise pour passer cette étape. Test hard-skill (quelques heures sur 7 jours) : un test technique à réaliser et à nous restituer à la date de ton choix, après avoir pris connaissance des consignes. L’objectif est d’évaluer tes compétences, tes habitudes et tes bonnes pratiques en lien avec le poste. Nous t'encouragerons à démontrer que tu sais utiliser le meilleur de l'I.A. Premier entretien structuré – skill & aptitude fit (1h30) : un échange avec les membres de l’équipe impliqués dans le recrutement, pour discuter de tes compétences et de ton expérience, et évaluer leur adéquation avec le poste. Second entretien structuré – culture fit & confirmation mutuelle (1h30) : un entretien avec un co-fondateur ou un autre membre de l’équipe, afin de vérifier des deux côtés que nous avons envie de travailler ensemble. 🌱 La vie chez Scalingo Chez Scalingo, nous sommes un acteur technologique exigeant, au service aussi bien de startups que de grandes entreprises et d’institutions publiques, sans être une méga-corporation. Cette position nous permet de conjuguer haut niveau d’exigence technique, impact concret et environnement de travail à taille humaine. Nous cultivons une culture du no bullshit : nous faisons ce que nous disons, nous prenons la responsabilité de nos succès comme de nos échecs, et nous privilégions des échanges honnêtes et directs. L’amélioration continue fait partie de notre ADN : nous questionnons régulièrement nos produits, nos pratiques et notre organisation pour progresser durablement. Chez Scalingo, nous avançons ensemble. La collaboration, la confiance et le soutien mutuel sont au cœur de notre manière de travailler. Nous évitons les silos et favorisons la transparence par défaut, afin que chacun puisse comprendre les enjeux, les décisions et le travail des autres. Nous accordons une grande importance à l’autonomie et à la responsabilité. Chacun est encouragé à prendre des initiatives, à faire des choix éclairés et à contribuer activement à l’évolution de l’entreprise, avec un cadre managérial présent et un suivi régulier. Enfin, nous croyons fermement à l’égalité des opportunités. Nous recrutons des personnes avant des CV, valorisons la diversité des parcours et veillons à créer un environnement respectueux, inclusif et équitable pour toutes et tous.

site-reliability-engineeringcloudcicd+3 更多
查看详情
full time
onsite/hybrid in united-states

## Company Description We provide Recruitment and Staffing services to many industries and domains through our innovative and customized solutions and passionate commitment to research. Our ability to understand hiring strategies, talent availability, and compensation benchmarking makes us a proud hiring partner for various industries. We work as trusted business partners and always strive to deliver the most value and highest return on investment for our clients. We are highly trained business professionals with a strong understanding of clients' needs. We work closely with leading staffing trade associations, training, and research organizations to ensure we are knowledgeable of the latest industry trends and technologies. ## Job Description **Must be Ex-Verizon** Able to work any shift in a 24x7 rotation. **Must haves:** - 8+ years of experience in telecom network provisioning and grooming. - Strong background with network turn-downs, traffic migration, activations/implementations, and maintenance. - Well-versed in related network technologies (Layer 1): - SONET (OC3, OC12, OC48, OC192) - TDM-Copper (DS0, DS1, DS3, SW voice trunks, PRI) - Experience with Ciena (4200, 6500, etc.), Fujitsu/Flashwave SONET, Nortel MUX, Optera, and DXCs (Telabs, Alcatel, Titans). - Prior Verizon system experience (BGW, TCOMS, Maintenance Tracker, etc.). - Well-versed in central office environments. - Able to work morning, mid, or night shifts (24x7 schedule). **Pluses:** - GIG-E, 10G, 100G, 400K, PIP. **Day to day:** Looking for a Network Grooming/Provisioning Engineer to work remotely. You will primarily be prepping/testing, grooming, and provisioning the customer's network in an effort to migrate/cut over traffic off of network elements that are being decommissioned for a central office. You will focus on layer 1 technologies (TDM/SONET) but support other network technologies as needed based on what else the environment encompasses. Tasks will include: 1. Collaborating with onsite techs at the central office/client site. 2. Verifying pathways to migrate traffic. 3. Taking action for maintenance, activation, and implementation activities. 4. Partnering with internal teams to assist with additional guidance. 5. Providing updates and escalations to the assigned program leader. 6. Scheduling tasks. **Regards,** **Mohammed Ilyas,** **PH - 229-264-4024 or Text - 229-469-1455 or you can share the updated resume at Mohammed@vtekis.com** ## Additional Information All your information will be kept confidential according to EEO guidelines. *When applying, mention the word CANDYSHOP to show you read the job post completely.*

site-reliability-engineering
查看详情
full time
remote

## About Consensys Consensys is the leading blockchain and web3 software company founded by Joe Lubin, CEO of Consensys and Co-Founder of Ethereum. Since 2014, Consensys has been at the forefront of innovation, pioneering technological developments within the web3 ecosystem. Through our product suite, including the MetaMask platform, Infura, Linea, Diligence, and our NFT toolkit Phosphor, we have become the trusted collaborator for users, creators, and developers on their path to build and belong in the world they want to see. Whether building a dapp, an NFT collection, a portfolio, or a better future, the instinct to build is universal. Consensys inspires and champions the builder instinct in everyone by making web3 universally easy to use and develop on. Our mission is to unlock the collaborative power of communities by making the decentralized web universally easy to access, use, and build on. You’ll get to work on the tools, infrastructure, and apps that scale these platforms to onboard one billion participants and 5 million developers. You’ll be constantly exposed to new concepts, ideas, and frameworks from your peers, and as you work on different projects — challenging you to stay at the top of your game. You’ll join a network of builders that reaches the edge of our ecosystem. Consensys alumni have moved on to become tech entrepreneurs, CEOs, and team leads at tech companies. ## About MetaMask We’re building for a future where the internet and world economy empowers people through interactions based on consent, privacy, and free association. Where both communities and individuals flourish. To accomplish that, we’re working hard to make web3 accessible for everyone around the world. MetaMask is both a crypto wallet and a gateway to the decentralized web. Our tools help people create communities, play video games, access financial services, make payments, invest in assets, protect against economic turmoil, and more. Our browser extension and mobile platforms meet the needs of millions of users and developers across the world. Originally a humble key manager, today MetaMask serves over 30 million monthly active users as a decentralized application development platform, an aggregator of decentralized cryptocurrency exchanges, and a decentralized identity manager. ## What you’ll do As a **DevSecOps Engineer** within MetaMask & Infura you will be joining a world reference technical team with State of the Art expertise in developing & delivering advanced decentralized applications (Ðapps) and supporting development and production infrastructure. Your job will be to deliver, upgrade and maintain infrastructure with high cybersecurity standards (ISO/SOC2). You will be involved and drive in our code deployment (CI / CD) and setting-up, configuring and running development/test and staging/production infrastructure across multiple products and critical applications and multiple cloud providers (AWS, Azure). With the objective to drive towards improving DORA metrics across our group. This position requires a combination of technological skills, organizational skills, thinking strategically, security first mindset, pro-activity, problem solving & creativity, pedagogical methodology, team spirit and strong documentation practices. It involves collaborating with developers, SREs, Product Managers and other roles within the business group and also our Customer Success SRE and cross cutting infrastructure teams working on multiple types of problems & challenges, empowering development teams on a day to day while thinking strategically and planning for platform growth. ## Would be great if you brought this to the role - A DevSecOps engineering mindset - 4+ years experience as a Software Engineer - 5+ years of experience as an SRE, Dev(Sec)Ops Engineer, Security Engineer or similar - Familiarity in one or more of the following languages: Javascript/NodeJS/Typescript, GoLang, Python - Experience with cloud environments such as AWS and/or Azure - Experience with Kubernetes, Helm, and containerization - Experience configuring CI/CD pipelines (e.g., GitHub, or CircleCI) - Experience with monitoring systems (LGTM, Prometheus, adot) - Experience with Infrastructure as Code (Terraform) - Good understanding of networking concepts (load balancers, routers, web application firewalls, ingress controllers) and cybersecurity practices - Experience with setting-up highly secure infrastructures with no Single Point of Failure, High Availability and Disaster Recovery - Ability to operate in a decentralized team and remote working environment - Strong documentation practices - Optional: experience in CI/CD, testing for iOS/Android apps at scale are a plus Don't meet all the requirements? Don't sweat it. We’re passionate about building a diverse team of humans and as such, if you think you've got what it takes for our chaotic-but-fun, remote-friendly, start-up environment—apply anyway, detailing your relevant transferable skills in your cover letter. While we have a pretty good idea of what we need, we're ready for you to challenge our thinking on who needs to be in this role. ## Additional Information *It is a requirement of employment in this position that applicants will be required to submit to background checks including but not limited to employment, education and criminal record checks. Further details will be provided to applicants that successfully meet the criteria for the position as determined by the company in its sole discretion. By submitting an application for employment, you are acknowledging and consenting to this requirement.* *The salary range for US-based candidates only will be determined throughout the interview process depending on experience and skills. Candidates should anticipate a base salary (not including bonus, equity or other benefits) of $160,000 - $218,000* **US pay range (not including bonus, equity or other benefits)** $164,000 — $218,000 USD *Consensys is an equal opportunity employer. We encourage people from all backgrounds to apply. We are committed to ensuring that our technology is made available and accessible to everyone. All employment decisions are made without regard to race, color, national origin, ancestry, sex, gender, gender identity or expression, sexual orientation, age, genetic information, religion, disability, medical condition, pregnancy, marital status, family status, veteran status, or any other characteristic protected by law. Consensys is aware of fraudulent recruitment practices and we encourage all applicants to review our best practices to protect yourself which can be found [here](https://Consensys.net/careers/best-practices-to-avoid-recruitment-fraud/).* In the rapidly evolving Web3 space, we believe that everyone is a builder. This expansive paradigm requires a range of backgrounds, talents, skills, and experiences to influence and shape the future. At Consensys, this diversity fuels our ability to shift control and redefine the realm of possibility. We are committed to ensuring that our technology empowers people and communities with economic and political agency through decentralized technologies. We welcome the range of perspectives and differences and celebrate them. We're excited to see how you

web3blockchainethereum+5 更多
查看详情
full time
onsite/hybrid in singapore singapore

## **What You’ll Do** * Write and own substantial amounts of production code in the most complex, high-risk, and business-critical areas of the platform. * Take end-to-end ownership of system behavior in production, especially under failure, stress, or adversarial conditions. * Operate confidently in ambiguous problem spaces, defining constraints, risks, and execution paths when clarity does not yet exist. * Anticipate and neutralize systemic technical, operational, and product risks through careful design, invariants, and operational discipline. * Simplify complex systems by eliminating accidental complexity, unsafe patterns, and brittle abstractions. * Set technical direction primarily through reference implementations, durable abstractions, and sustained ownership of core systems. * Act as one of the strongest technical coaches on the team through high-signal code reviews, design discussions, and example. * Influence product and technical decisions by deeply understanding system behavior, user incentives, and long-term trade-offs. * Participate actively in incident response, and ensure durable improvements through root-cause analysis and follow-through. * Raise the bar for correctness, reliability, and operability by making the right designs and behaviors the default for others. ## **What We’re Looking For** We’re looking for exceptional individual contributors who consistently demonstrate strong performance across our core competencies, and who provide outsized leverage through a small number of exceptional strengths. In particular, you should have: * A track record of owning and evolving complex, production-critical systems. * Exceptional engineering judgment in ambiguous, high-risk, or adversarial environments. * Deep comfort reasoning about failure modes, edge cases, and second- or third-order effects. * Strong operational ownership, including designing for observability, safe failure, and recoverability. * The ability to simplify complex systems without losing essential correctness or safety. * A demonstrated ability to influence others through code quality, design clarity, and technical credibility. * Clear, precise communication, especially when articulating risks, trade-offs, and system behavior. **Nice to Have** * Deep domain expertise in trading systems, exchange mechanics, or financial infrastructure. * Expertise in one or more high-leverage areas such as: * Reliability and operational excellence * Adversarial or incentive-aware system design * Architectural simplification and technical leverage * Deterministic, performance-sensitive, or event-driven systems * Experience operating systems under significant load, stress, or real-world failure scenarios. ## **This Role Is Not a Fit If** * You prefer narrowly scoped work with clearly defined requirements. * You are uncomfortable owning ambiguous, high-risk problems end-to-end. * You avoid deep production ownership or incident responsibility. * You prefer influencing through authority or process rather than through technical leadership. * You are not interested in remaining deeply hands-on as an individual contributor. ## **How Success Is Measured** Success in this role is measured by: * The long-term correctness, stability, and clarity of the systems you anchor. * Reduction of systemic risk and classes of failure over time. * How much easier it becomes for others to do the right thing because of your work. * The degree to which your technical judgment is trusted on the hardest problems. ## Final Note Principal Engineers here are **not managers and not detached architects**. They are exceptional ICs who take responsibility for the hardest problems and make the organization meaningfully stronger by doing the work that only they can do. When applying, mention the word **CANDYSHOP** to show you read the job post completely.

blockchainweb3crypto+3 更多
查看详情
full time
onsite/hybrid in singapore singapore

## **What You’ll Do** * Lead the design and delivery of complex systems and multi-service projects in our trading platform. * Take system-level ownership of **correctness, reliability, and operability** in production. * Anticipate and mitigate architectural, operational, and product risks before they result in incidents or user impact. * Act as a technical co-lead with Product, Risk, Operations, and other stakeholders to drive clarity, alignment, and execution. * Provide high-quality code reviews that surface risks, clarify assumptions, and raise engineering standards. * Coach and mentor engineers through design discussions, feedback, and day-to-day collaboration. * Drive technical leverage by introducing abstractions, patterns, or platforms that structurally improve correctness, reliability, and long-term velocity. * Participate actively in incident response and ensure durable improvements through root-cause analysis and follow-up work. * Communicate complex technical decisions clearly and effectively to both technical and non-technical audiences. ## **What We’re Looking For** We’re looking for engineers who consistently demonstrate strong performance across our core competencies, and who are ready to operate with **broader scope and higher leverage**. In particular, you should have: * Proven ability to reason about and evolve complex, stateful systems safely. * Strong engineering judgment and a track record of making thoughtful trade-offs in ambiguous situations. * Deep ownership of production systems, including incident response and post-incident improvement. * A clear bias toward correctness, reliability, and defensive system design. * The ability to raise the quality and effectiveness of others through coaching, reviews, and example. * Strong cross-functional collaboration skills, including the ability to push back thoughtfully and align stakeholders around shared outcomes. * Clear, precise communication, especially when discussing risks, trade-offs, and complex system behavior. **Nice to Have** * Deep experience with trading systems, exchanges, or other financial infrastructure. * Experience designing systems with adversarial users or strong economic incentives. * Expertise in Go (Golang), distributed systems, performance engineering, or event-driven architectures. * Prior experience driving architectural or platform-level improvements in a production environment. ## **This Role Is Not a Fit If** * You prefer focusing primarily on individually scoped tasks or features. * You are uncomfortable owning system-level outcomes or production behavior. * You avoid cross-functional responsibility or difficult alignment conversations. * You optimize for short-term delivery over long-term correctness, reliability, and maintainability. * You are not interested in mentoring, coaching, or raising the technical bar for others. We care deeply about judgment, ownership, and impact, especially in complex and high-stakes systems. Strong candidates for this role demonstrate not only technical depth, but the ability to make the team and the system meaningfully better over time. When applying, mention the word **CANDYSHOP** to show you read the job post completely.

blockchaingoweb3+2 更多
查看详情
full time
remote worldwide

## **Join Tether and Shape the Future of Digital Finance** At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction. ### **Innovate with Tether** * **Tether Finance:** Our innovative product suite features the world’s most trusted stablecoin, **USDT**, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services. But that’s just the beginning: * **Tether Power:** Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities. * **Tether Data:** Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like **KEET**, our flagship app that redefines secure and private data sharing. * **Tether Education:** Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity. * **Tether Evolution:** At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways. ### **Why Join Us?** Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry. If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you. **Are you ready to be part of the future?** ## **About the job** We are seeking a highly skilled **DevOps Specialist to architect, implement, and maintain end-to-end CI/CD pipelines using GitHub**, with a strong focus on test-driven deployment, automated release management, and Infrastructure as Code (IaC) practices. The ideal candidate will containerize services using Docker, manage build, compilation, and publishing workflows for JavaScript, TypeScript, and C++ packages, and design robust multi-language build pipelines leveraging CMake or similar build tools. The role also includes automating build, tagging, and publication processes across web, desktop, and mobile platforms to ensure consistent and traceable releases. Deep expertise in Linux system administration, networking, and IaC tools is essential to deliver scalable, secure, and highly available deployments. ## **Responsibilities** * **Lead the design, architecture, and management of CI/CD pipelines** using GitHub Actions (and similar tools), ensuring fast, reliable, and reproducible software delivery. * **Implement and enforce test-driven deployment systems**, integrating automated testing, validation, and monitoring to maintain code quality and accelerate feedback cycles. * **Containerize applications and microservices** with Docker, optimize image builds, and manage deployment pipelines for distributed environments. * **Oversee the build, packaging, and publishing lifecycle** for JavaScript, TypeScript, and C++ packages, including versioning, semantic tagging, and NPM or internal registry publication. * **Develop and maintain cross-platform build pipelines** using CMake or equivalent tools, ensuring consistent compilation and release workflows across web, desktop, and mobile. * **Automate end-to-end release processes**, including tagging, building, signing, and distributing mobile, web, and desktop applications. * **Define and manage Infrastructure as Code (IaC)** to provision and maintain reliable, scalable, and secure infrastructure environments. * **Collaborate closely with development, QA, and operations teams** to troubleshoot deployment issues, optimize performance, and improve release reliability. * **Continuously improve observability and feedback loops**, leveraging monitoring and alerting systems to maintain operational excellence. ## **Mandatory** * **Bachelor’s or Master’s degree** in Computer Science, Engineering, or a related discipline. * **3+ years of hands-on experience** architecting and maintaining CI/CD pipelines using GitHub Actions or equivalent tools at scale in a production environment. * **Strong proficiency in test-driven deployment methodologies**, including writing and maintaining automated test suites for integration and end-to-end validation. * **Expertise in containerization technologies** such as Docker, including image creation, registry management, and basic orchestration patterns. * **Experience managing package lifecycles** for JavaScript and TypeScript, including versioning, compilation, semantic tagging, and publishing workflows to NPM. * **In-depth knowledge of C++ build systems**, specifically CMake, with proven experience optimizing native build and deployment pipelines. * **Advanced Linux system administration and networking skills**, including shell scripting, package management, performance troubleshooting, firewalls, and VPN configuration. * **Excellent communication, problem-solving, and collaboration skills**, with the ability to work effectively in globally distributed teams. * **Experience with Infrastructure as Code (IaC)** tools such as Terraform, Ansible, AWS CDK or AWS CloudFormation. * **Experience with mobile CI/CD automation**, including build, tagging, and publication for iOS and Android applications. * **Advanced knowledge of release management practices**, including automated versioning, signing, and artifact distribution. ## **Preferred** * **Experience with cloud platforms** (AWS, GCP, Azure) and their managed services. * **Knowledge of security best practices** for CI/CD pipelines and infrastructure (Secrets Management, SAST/DAST). * **Familiarity with monitoring and observability stacks** (Prometheus, Grafana, ELK, Datadog). * **Contributions to open-source projects** or a public portfolio of relevant work.

dockercicdjavascript+6 更多
查看详情
full time
remote worldwide

## **Join Tether and Shape the Future of Digital Finance** At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction. ### **Innovate with Tether** * **Tether Finance:** Our innovative product suite features the world’s most trusted stablecoin, **USDT**, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services. But that’s just the beginning: * **Tether Power:** Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities. * **Tether Data:** Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like **KEET**, our flagship app that redefines secure and private data sharing. * **Tether Education:** Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity. * **Tether Evolution:** At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways. ### **Why Join Us?** Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry. If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you. **Are you ready to be part of the future?** ## **About the job** We are seeking a highly skilled **DevOps Specialist to architect, implement, and maintain end-to-end CI/CD pipelines using GitHub**, with a strong focus on test-driven deployment, automated release management, and Infrastructure as Code (IaC) practices. The ideal candidate will containerize services using Docker, manage build, compilation, and publishing workflows for JavaScript, TypeScript, and C++ packages, and design robust multi-language build pipelines leveraging CMake or similar build tools. The role also includes automating build, tagging, and publication processes across web, desktop, and mobile platforms to ensure consistent and traceable releases. Deep expertise in Linux system administration, networking, and IaC tools is essential to deliver scalable, secure, and highly available deployments. ## **Responsibilities** * **Lead the design, architecture, and management of CI/CD pipelines** using GitHub Actions (and similar tools), ensuring fast, reliable, and reproducible software delivery. * **Implement and enforce test-driven deployment systems**, integrating automated testing, validation, and monitoring to maintain code quality and accelerate feedback cycles. * **Containerize applications and microservices** with Docker, optimize image builds, and manage deployment pipelines for distributed environments. * **Oversee the build, packaging, and publishing lifecycle** for JavaScript, TypeScript, and C++ packages, including versioning, semantic tagging, and NPM or internal registry publication. * **Develop and maintain cross-platform build pipelines** using CMake or equivalent tools, ensuring consistent compilation and release workflows across web, desktop, and mobile. * **Automate end-to-end release processes**, including tagging, building, signing, and distributing mobile, web, and desktop applications. * **Define and manage Infrastructure as Code (IaC)** to provision and maintain reliable, scalable, and secure infrastructure environments. * **Collaborate closely with development, QA, and operations teams** to troubleshoot deployment issues, optimize performance, and improve release reliability. * **Continuously improve observability and feedback loops**, leveraging monitoring and alerting systems to maintain operational excellence. ## **Mandatory Requirements** * **Bachelor’s or Master’s degree** in Computer Science, Engineering, or a related discipline. * **3+ years of hands-on experience** architecting and maintaining CI/CD pipelines using GitHub Actions or equivalent tools at scale in a production environment. * **Strong proficiency in test-driven deployment methodologies**, including writing and maintaining automated test suites for integration and end-to-end validation. * **Expertise in containerization technologies** such as Docker, including image creation, registry management, and basic orchestration patterns. * **Experience managing package lifecycles** for JavaScript and TypeScript, including versioning, compilation, semantic tagging, and publishing workflows to NPM. * **In-depth knowledge of C++ build systems**, specifically CMake, with proven experience optimizing native build and deployment pipelines. * **Advanced Linux system administration and networking skills**, including shell scripting, package management, performance troubleshooting, firewalls, and VPN configuration. * **Excellent communication, problem-solving, and collaboration skills**, with the ability to work effectively in globally distributed teams. * **Experience with Infrastructure as Code (IaC)** tools such as Terraform, Ansible, AWS CDK or AWS CloudFormation. * **Experience with mobile CI/CD automation**, including build, tagging, and publication for iOS and Android applications. * **Advanced knowledge of release management practices**, including automated versioning, signing, and artifact distribution. ## **Preferred Qualifications** * Experience with other CI/CD platforms (e.g., Jenkins, GitLab CI, CircleCI). * Knowledge of Kubernetes or other container orchestration platforms. * Familiarity with cloud platforms (AWS, GCP, Azure) and their DevOps services. * Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack). * Understanding of security best practices in CI/CD and infrastructure (DevSecOps).

dockercicdblockchain+5 更多
查看详情