What Jobs are available for Software Infrastructure in Hong Kong?
Showing 51 Software Infrastructure jobs in Hong Kong
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Job Responsibilities
- Contribute to the design and setup of IT infrastructure, including hardware installation and system configuration
- Monitor server performance and resolve hardware, software, and network issues
- Conduct regular maintenance and perform system updates and upgrades
- Assist in implementing and maintaining IT security protocols and infrastructure protection
- Ensure high availability of IT services by managing server and system operations
- Safeguard business data through effective backup strategies and recovery planning
Job Requirements
- Minimum 3 years of experience in IT infrastructure projects
- Strong knowledge of networking protocols and technologies: TCP/IP, Layer 2/3 switching and routing, policy-based routing, OSPF, load balancing, network segmentation, resilience, and performance diagnostics
- Familiarity with LAN/WAN, DHCP, DNS, routing, switching, and firewall concepts
- Experience in cybersecurity, Active Directory, networking, Microsoft admin tasks (e.g., account/license management), VMware, and Linux is a Plus
- HCIA and HCIP (Storage) certifications preferred
- Experience with Fortinet and Huawei firewalls/switches is a plus
- Exposure to virtualization platforms and Kubernetes administration is advantageous
Is this job a match or a miss?
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Infrastructure Engineer - Video Generation
About Us
We are a cutting-edge AI startup specializing in next-generation video generation technology based in Hong Kong. Our mission is to push the boundaries of what's possible in AI-driven video generation through innovation of foundation model. As a growing startup, we offer a dynamic environment where your research can have immediate impact on technology development.
Position Overview
We are seeking an experienced Infrastructure Engineer to architect and manage our AI computing infrastructure. The ideal candidate will have extensive experience in building and scaling ML infrastructure, with particular emphasis on distributed training systems and GPU cluster management.
Key Responsibilities
- Design and implement high-performance computing infrastructure for large-scale AI model training
- Manage and optimize GPU clusters for distributed training workloads
- Build and maintain container orchestration systems for ML workflows
- Implement efficient resource allocation and scheduling systems
- Design and maintain monitoring and alerting systems for compute infrastructure
- Optimize infrastructure costs while maintaining performance
- Collaborate with ML teams to support their computing needs
- Ensure system reliability, security, and scalability
Required Qualifications
- Master's degree in Computer Science, Systems Engineering, or related field
- 8+ years of experience in infrastructure engineering, with focus on ML/AI infrastructure
- Strong experience with:
- GPU cluster management and optimization
- Kubernetes and container orchestration
- Linux system administration
- Infrastructure as Code (IaC)
- Proven track record in building large-scale computing systems
- Experience with major cloud providers (AWS/GCP/Azure or Alibaba Cloud/Tencent Cloud etc)
Preferred Qualifications
- Experience with ML infrastructure at major tech companies
- Knowledge of distributed training systems (PyTorch DDP, Horovod)
- Familiarity with ML frameworks and their infrastructure requirements
- Experience with high-performance networking (InfiniBand, RDMA)
- Background in performance optimization and troubleshooting
- Understanding of ML workload characteristics
- Bilingual proficiency (English/Chinese)
Technical Skills
Computing Infrastructure
- GPU Clusters: NVIDIA DGX, GPU management tools
- Distributed Systems: Slurm, Kubernetes
- ML Platforms: Kubeflow, Ray
- Job Scheduling: YARN, Slurm
Cloud & Networking
- Cloud Platforms:
- International: AWS, GCP, Azure
- China: Alibaba Cloud, Tencent Cloud
- Networking: InfiniBand, RDMA, TCP/IP optimization
- Load Balancing: HAProxy, NGINX
Infrastructure Management
- Container Technologies: Docker, Kubernetes, Singularity
- IaC: Terraform, Ansible, CloudFormation
- CI/CD: Jenkins, GitLab CI
- Monitoring: Prometheus, Grafana, ELK Stack
Development
- Languages: Python, Go, Shell scripting
- Version Control: Git
- Documentation: Markdown, Confluence
What We Offer
- Opportunity to build cutting-edge AI infrastructure
- Competitive salary and equity package
- Access to latest hardware and technologies
- Professional development opportunities
- Comprehensive health benefits
- Learning and conference budget
Location
- Hong Kong (on-site, Hong Kong Science and Technology Park)
Expected Impact
- Design and implement next-generation AI computing infrastructure
- Optimize resource utilization and cost efficiency
- Improve training speed and efficiency for AI models
- Build scalable and reliable systems
Projects You'll Work On
- Building automated GPU cluster management systems
- Implementing efficient resource scheduling for ML workloads
- Optimizing distributed training infrastructure
- Setting up monitoring and observability systems
- Designing disaster recovery and backup solutions
To Apply:
Please submit:
- Detailed CV highlighting relevant infrastructure projects
- Description of the largest scale system you've built/managed
- Examples of infrastructure optimization achievements
- Professional references
To apply or learn more about this position, please contact
Is this job a match or a miss?
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Infrastructure Engineer | Location: Tsuen Wan, Hong Kong | Department: Tech / Infrastructure | Full-time
Join a results-driven, agile team that powers next-gen trading and fintech platforms. Whether you're coding, defending, scaling, or supporting—your work matters.
Infrastructure EngineerMaintain and upgrade IT infrastructure that powers mission-critical operations.
Our comprehensive packages for you:
We offer attractive remuneration package, including Annual Leave, Double Pay, Year-end bonus, Medical Benefits, Holiday and Birthday Benefits, Marriage Leave, Paternity Leave, Maternity leave, Promotion Opportunities and Travel Rewards etc.
Job Highlights
- Manage IT infrastructure across server, storage, and virtualization platforms
- Contribute to high-availability architecture for business continuity
- Opportunity to lead infrastructure projects in a fast-paced fintech environment
Job Responsibilities
- Design, implement, and maintain physical and virtual IT infrastructure
- Ensure uptime and performance of systems through proactive monitoring
- Maintain and upgrade hardware and software systems as needed
- Handle backup, recovery, and disaster recovery planning
- Work closely with network and security teams on system integration
- Provide infrastructure documentation and technical support
Job Requirements
- Bachelor's degree in IT, Computer Engineering, or a related discipline
- Minimum 3 years' experience in system administration or infrastructure support
- Proficiency in VMware, Windows/Linux servers, storage systems, and backup tools
- Knowledge of cloud services (e.g., AWS, Azure) is an advantage
- Familiarity with automation tools and scripting (e.g., PowerShell, Python)
- Strong analytical and troubleshooting skills
Apply Now
Send your full resume with current & expected salary via "Quick Apply".
Join us and turn every challenge into a stepping stone of your career.
Is this job a match or a miss?
Infrastructure Engineer
Posted today
Job Viewed
Job Description
AT Global Services Limited is seeking potential talent who is attention to detail, accurate, responsible, independent, initiative, motivated, and interested in joining our IT Team.
The Operations Engineer position will be responsible for providing on-going support, including server, database and cloud platforms, as well as implementation support for enhancing the IT infrastructure.
Responsibilities
• Ensure the IT infrastructure supports business requirements, such as windows servers, virtualization, disaster recovery, networking, windows servers, backup and email systems
• Monitor system performance, troubleshoot issues, and ensure system availability and reliability
• Review different policies according to security needs and business needs in order to provide support to the development, implementations and on-going support
• Provide technical support and guidance on operations validation, development and implementation
• Develop operational solutions and assist in the development and maintenance of operational standards and procedures including ITIL processes
• Manage production changes including application deployment
• Adopt cloud infrastructure solution to support IT projects and company needs
• Ad hoc duties as assigned
Job Requirements
• Degree or above in Computer Engineering/Computer Science or related discipline
• 3+ years' experience in IT Operations
• Hands-on experience on at least two of the followings:-
Window/VMware/UNIX/Linux operating systems (installation, scripting, configuration and troubleshooting)
MySQL/MongoDB/SQL databases (create and maintain user roles, assign privileges and backup)
Cloud computing (Aliyun, Azure, AWS) (configuration, security, storage, backup and disaster recovery)
• Solid experience in supporting financial trading systems is plus
• Excellent problem-solving skills with the ability to troubleshoot and diagnose complex technical issues.
• Self-motivated and strong sense of responsibilities
• Good command of written, spoken English and Chinese
• Good command of spoken Mandarin is preferable
Willing to work flexible schedule
We offer an exciting career opportunity and competitive remuneration to the successful candidate. Please send us your detailed resume with current and expected salary by clicking "Apply Now"
All information provided will be treated in strict confidence and used solely for the recruitment purposes.
Is this job a match or a miss?
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
- Collaborate with cross-functional teams and vendors on infrastructure design.
- Site preparation, system design, installation, acceptance, and documentation for customer projects.
- Apply fixes, patches, and system updates as necessary.
- Troubleshoot issues and provide technical support to ensure project success.
- Effective communications with customers to ensure customer satisfaction.
Requirements
- Bachelor's degree in Information Technology, Computer Science, or a related field.
- At least 5 years of hands-on experiences on server virtualization, storage and backup.
- Expertise with VMware vsphere, vSAN or Microsoft Hyper-V. Experiences on Chinese based hypervisor will be an advantage.
- Expertise with Netapp, Dell or HPE storage.
- Expertise with Veeam or Veritas Netbackup backup software. Experiences on Chinese based backup software will be an advantage.
- Problem solving skill with analytical thinking.
- Enthusiastic about China technologies.
- Proficient in Cantonese, English and Mandarin.
- Excellent written English and Chinese.
Is this job a match or a miss?
Infrastructure Engineer
Posted today
Job Viewed
Job Description
About Moonvalley
Moonvalley's mission is to solve Visual Intelligence in the age of generative AI. We are building technology that can tell stories, scale creativity, and understand both the physics and semantics of the world. With Marey, our first high-definition foundation model trained exclusively on licensed data, we are powering the next era of cinematic, commercial, and enterprise-grade creation.
Our team is an unprecedented convergence of talent across industries. Our elite AI scientists from Deepmind, Google, Microsoft, Meta & Snap, have decades of collective experience in machine learning and computational creativity. We have also established the first AI-enabled movie studio in Hollywood, filled with accomplished filmmakers and visionary creative talent. We work with the top producers, actors, and filmmakers in Hollywood as well as creative-driven global brands. So far we've raised over $100M+ from world-class investors including General Catalyst, Bessemer, Khosla Ventures & YCombinator – and we're just getting started.
Job Summary
We're hiring an Infrastructure Engineer to design and maintain the systems that power Moonvalley's generative AI research and product development. You'll be joining at a pivotal moment, helping to define the foundations of our infrastructure as we train and deploy cutting-edge video foundation models.
In this role, you'll work closely with researchers, engineers, and cross-functional partners to ensure our infrastructure is scalable, reliable, and efficient. From managing GPU clusters to optimizing ETL pipelines, you'll be instrumental in ensuring the technical performance and productivity of our entire AI platform.
What you'll do
Build, manage, and scale GPU infrastructure using tools like Kubernetes, Terraform, or Pulumi
Maintain and optimize ETL pipelines using Spark, Ray, or Airflow
Operate and improve our telemetry and monitoring stack (Datadog, Grafana, Weights & Biases)
Manage CI/CD pipelines and development tooling (GitHub, PyTorch, Python)
Track and optimize datasets, checkpoints, compute utilization, and related assets
Automate repetitive tasks to improve efficiency and reduce friction across engineering workflows
Participate in an on-call rotation to resolve infrastructure issues and ensure uptime
Provide tooling, documentation, and support to accelerate internal engineering productivity
What we're looking for
Strong generalist with experience managing large-scale, high-performance infrastructure
Skilled in designing scalable systems for compute, data, and developer tooling
Comfortable in high-urgency environments with the ability to prioritize for impact
Familiar with infrastructure stacks for AI model training and experimentation
Experienced with Kubernetes, Terraform/Pulumi, Spark/Ray, and observability tools
Pragmatic problem-solver who favors automation and simplicity over complexity
Open to using and contributing to open-source tooling when appropriate
Bonus: experience as a Cluster Engineer, Data Engineer, or Developer Advocate in AI/ML environments
What we offer (compensation & benefits)
Competitive salary and equity
Private health coverage
Pension contribution
Unlimited paid vacation
Fully-distributed, async-first culture
Hardware setup of your choice
Stipends for phone, internet, and meals
In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation.
If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for.
All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company.
If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you
The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work
Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs.
Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy located here for further information.
Is this job a match or a miss?
Infrastructure Engineer
Posted today
Job Viewed
Job Description
- Degree holder in Computer Science, Information Technology or relevant discipline;
- Minimum of 3 years of IT-related work experience (more than 5 years is preferable);
- Familiar with network, TCP/IP and Oracle database;
- Familiar with Huawei cloud computing, with experience in installation and troubleshoot of Openstack, Manageone, Fusioncompute, Fusionaccess, Fusionsphere, etc.;
- Familiar with Huawei V3 storage, OceanStor T V2 series storage, SNS series storage devices, familiar with the installation and troubleshoot of RH series and E9000 series servers, and has delivered storage active-active and disaster recovery priority;
- Familiar with the installation and troubleshoot of servers, switches, firewalls and other equipment;
- Applicants with HCIE-cloud and HCIE-storage is preferable;
- Excellent problem-solving and strong analytical skills;
- Good command of written and spoken English/Cantonese/Mandarin.
Is this job a match or a miss?
Be The First To Know
About the latest Software infrastructure Jobs in Hong Kong !
Senior Infrastructure Engineer
Posted today
Job Viewed
Job Description
Our client, a Global MNC, is looking for a senior Infrastructure Engineer. You will play a pivotal role in redesigning, maintaining, and evolving the infrastructure landscape to support the brand's digital transformation and operational excellence across datacenter and corporate environments.
Requirements:
- Strong technical expertise in virtualization, server hardware, datacenter management, and OS administration.
- Experienced in monitoring tools such as zabbix/grafana
- Solid understanding of networking fundamentals.
- Excellent communication skills in English (written and spoken), Cantonese or Mandarin will be a plus
- Familiarity with ITIL frameworks and service management principles.
Is this job a match or a miss?
Cloud Infrastructure Engineer
Posted today
Job Viewed
Job Description
Responsibilities:
- Test patches in lower environments (e.g., Dev and Sandbox) before deployment.
- Create and share production-ready commands/scripts for validation in the production environment.
- Provide expert support in AWS cloud security best practices.
- Support HK's application security program through secure design reviews, threat modeling, and code-level security guidance.
- Perform penetration testing of HK applications as requested by the HK security team.
- Validate security fixes and provide re-test reports to confirm closure of identified issues.
- Provide timely mitigation guidance, including recommended patches, configuration changes, or compensating controls.
Is this job a match or a miss?
Senior Infrastructure Engineer
Posted today
Job Viewed
Job Description
Job Description:
As a senior member of the Victory Fintech infrastructure technology team reporting to the infrastructure team lead working closely with the rest of the infrastructure, and wider technology and other company teams.
As part of the company's infrastructure team, joint responsibilities include:
- All aspects of technology infrastructure for the company including but not limited to on-premise physical and cloud based: servers, networks, networking equipment, office technology, security and AV. equipment and end user devices such as desktop, laptop and mobile devices
- Working closely with the other technology teams in the implementation and operations of developer tooling such as continuous delivery pipelines, source code management, monitoring, alerting, artefact management, key management and other software development infrastructure.
- Following and complying with all company policies and procedures, particularly physical and cyber security policies and procedures.
- Participating in technology team meetings, training and career development activities.
- Work closely with other technology team members, business stakeholders, product owners to understand and capture future infrastructure requirements.
- Ensuring infrastructure production incidents are managed through their full lifecycle according to company policies.
- Keep up to date with the latest developments in infrastructure technology and share with the wider team.
- Positively contribute to the culture of the company, helping uphold and further the company and technology Mission, Vision, Values and Principles.
Job Requirements:
- Degree in Computer Science, Information Technology or a related field or equivalent work experience.
- Minimum of 5 years of relevant experience in IT infrastructure in financial industry.
- Strong experience of working with modern Cloud based infrastructure and working practices (particularly experience with AWS and infrastructure as code, ideally Terraform).
- Strong knowledge and ideally experience of DevOps, DevSecOps and/or SRE working practices
- Experience working with and managing the full life cycle container based infrastructure including Kubernetes
- Strong experience of ensuring infrastructure has good observability, including monitoring and alerting
- Experience of managing CI/CD tooling and infrastructure
- Strong security mindset, with knowledge of modern security principles, including zero trust.
- Strong experience of timely incident management in production environments
- Knowledge and solid experience in Linux and general networking.
- Excellent communication skills, capable of translating technical information into business language.
- Good command of both Chinese and English (including Mandarin) preferred.
Is this job a match or a miss?