65 Software Infrastructure jobs in Hong Kong
Infrastructure Engineer
Posted 24 days ago
Job Viewed
Job Description
About Us
We are a cutting-edge AI startup specializing in next-generation video generation technology based in Hong Kong. Our mission is to push the boundaries of what's possible in AI-driven video generation through innovation of foundation model. As a growing startup, we offer a dynamic environment where your research can have immediate impact on technology development.
Position OverviewWe are seeking an experienced Infrastructure Engineer to architect and manage our AI computing infrastructure. The ideal candidate will have extensive experience in building and scaling ML infrastructure, with particular emphasis on distributed training systems and GPU cluster management.
Key Responsibilities- Design and implement high-performance computing infrastructure for large-scale AI model training
- Manage and optimize GPU clusters for distributed training workloads
- Build and maintain container orchestration systems for ML workflows
- Implement efficient resource allocation and scheduling systems
- Design and maintain monitoring and alerting systems for compute infrastructure
- Optimize infrastructure costs while maintaining performance
- Collaborate with ML teams to support their computing needs
- Ensure system reliability, security, and scalability
- Master's degree in Computer Science, Systems Engineering, or related field
- 8+ years of experience in infrastructure engineering, with focus on ML/AI infrastructure
- Strong experience with:
- GPU cluster management and optimization
- Kubernetes and container orchestration
- Infrastructure as Code (IaC)
- Proven track record in building large-scale computing systems
- Experience with major cloud providers (AWS/GCP/Azure or Alibaba Cloud/Tencent Cloud etc)
- Experience with ML infrastructure at major tech companies
- Knowledge of distributed training systems (PyTorch DDP, Horovod)
- Familiarity with ML frameworks and their infrastructure requirements
- Experience with high-performance networking (InfiniBand, RDMA)
- Background in performance optimization and troubleshooting
- Understanding of ML workload characteristics
Computing Infrastructure
- Job Scheduling: YARN, Slurm
- Networking: InfiniBand, RDMA, TCP/IP optimization
Infrastructure Management
- IaC: Terraform, Ansible, CloudFormation
Development
- Languages: Python, Go, Shell scripting
- Version Control: Git
- Documentation: Markdown, Confluence
- Opportunity to build cutting-edge AI infrastructure
- Competitive salary and equity package
- Access to latest hardware and technologies
- Learning and conference budget
- Hong Kong (on-site, Hong Kong Science and Technology Park)
- Design and implement next-generation AI computing infrastructure
- Optimize resource utilization and cost efficiency
- Improve training speed and efficiency for AI models
- Build scalable and reliable systems
- Building automated GPU cluster management systems
- Setting up monitoring and observability systems
- Designing disaster recovery and backup solutions
Please submit the following:
- Detailed CV highlighting relevant infrastructure projects
- Description of the largest scale system you've built/managed
- Examples of infrastructure optimization achievements
- Professional references
To apply or learn more about this position, please contact
Note: This posting reflects a current opening. Only shortlisted candidates will be contacted.
#J-18808-LjbffrInfrastructure Engineer
Posted today
Job Viewed
Job Description
The Role: We're looking for a hands-on and technically strong talent to join our team. This Infrastructure Engineer role is ideal for someone with solid experience in both networking and systems, who enjoys working on internal projects and operation.
What You'll Do:
- Configure, implement, and support network infrastructure including switches, routers (Cisco & Huawei), wireless Access Points (APs), VLAN setup, VPN configuration, and network security components such as firewalls and access controls.
- Support and maintain server infrastructure including Windows Server and virtualization platforms (VMware/Hyper-V).
- Participate in internal infrastructure projects from planning to deployment, ensuring timely and quality delivery.
- Collaborate with vendors and internal teams for project execution and issue resolution.
- Perform system upgrades, patching, and routine maintenance across network and server environments.
- Provide Level 2 support across infrastructure components, ensuring stability and performance.
What We're Looking For:
- 5+ years of experience in IT infrastructure, with strong hands-on skills in networking and systems.
- Experience with Cisco and/or Huawei network devices.
- Solid understanding of Windows Server, virtualization (VMware/Hyper-V), and basic Microsoft 365 administration.
- Strong troubleshooting skills and ability to work independently.
- Good communication and documentation skills.
- Relevant certifications (e.g., CCNA, Microsoft, ITIL) are a plus.
Why Join Us?
- Be part of a stable and reputable company that offers long-term career.
- Grow your career with opportunities for advancement and skill development.
- Attractive remuneration package: guarantee bonus, staff meal, allowance and etc.
We are hiring within these few weeks and If you think this role is suitable for you please send your CV to Carsten, or call me at to discuss further.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
About Moonvalley
Moonvalley's mission is to solve Visual Intelligence in the age of generative AI. We are building technology that can tell stories, scale creativity, and understand both the physics and semantics of the world. With Marey, our first high-definition foundation model trained exclusively on licensed data, we are powering the next era of cinematic, commercial, and enterprise-grade creation.
Our team is an unprecedented convergence of talent across industries. Our elite AI scientists from Deepmind, Google, Microsoft, Meta & Snap, have decades of collective experience in machine learning and computational creativity. We have also established the first AI-enabled movie studio in Hollywood, filled with accomplished filmmakers and visionary creative talent. We work with the top producers, actors, and filmmakers in Hollywood as well as creative-driven global brands. So far we've raised over $100M+ from world-class investors including General Catalyst, Bessemer, Khosla Ventures & YCombinator – and we're just getting started.
Job Summary
We're hiring an Infrastructure Engineer to design and maintain the systems that power Moonvalley's generative AI research and product development. You'll be joining at a pivotal moment, helping to define the foundations of our infrastructure as we train and deploy cutting-edge video foundation models.
In this role, you'll work closely with researchers, engineers, and cross-functional partners to ensure our infrastructure is scalable, reliable, and efficient. From managing GPU clusters to optimizing ETL pipelines, you'll be instrumental in ensuring the technical performance and productivity of our entire AI platform.
What you'll do
Build, manage, and scale GPU infrastructure using tools like Kubernetes, Terraform, or Pulumi
Maintain and optimize ETL pipelines using Spark, Ray, or Airflow
Operate and improve our telemetry and monitoring stack (Datadog, Grafana, Weights & Biases)
Manage CI/CD pipelines and development tooling (GitHub, PyTorch, Python)
Track and optimize datasets, checkpoints, compute utilization, and related assets
Automate repetitive tasks to improve efficiency and reduce friction across engineering workflows
Participate in an on-call rotation to resolve infrastructure issues and ensure uptime
Provide tooling, documentation, and support to accelerate internal engineering productivity
What we're looking for
Strong generalist with experience managing large-scale, high-performance infrastructure
Skilled in designing scalable systems for compute, data, and developer tooling
Comfortable in high-urgency environments with the ability to prioritize for impact
Familiar with infrastructure stacks for AI model training and experimentation
Experienced with Kubernetes, Terraform/Pulumi, Spark/Ray, and observability tools
Pragmatic problem-solver who favors automation and simplicity over complexity
Open to using and contributing to open-source tooling when appropriate
Bonus: experience as a Cluster Engineer, Data Engineer, or Developer Advocate in AI/ML environments
What we offer (compensation & benefits)
Competitive salary and equity
Private health coverage
Pension contribution
Unlimited paid vacation
Fully-distributed, async-first culture
Hardware setup of your choice
Stipends for phone, internet, and meals
In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation.
If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for.
All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company.
If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you
The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work
Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs.
Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy located here for further information.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
- Collaborate with cross-functional teams and vendors on infrastructure design.
- Site preparation, system design, installation, acceptance, and documentation for customer projects.
- Apply fixes, patches, and system updates as necessary.
- Troubleshoot issues and provide technical support to ensure project success.
- Effective communications with customers to ensure customer satisfaction.
Requirements
- Bachelor's degree in Information Technology, Computer Science, or a related field.
- At least 5 years of hands-on experiences on server virtualization, storage and backup.
- Expertise with VMware vsphere, vSAN or Microsoft Hyper-V. Experiences on Chinese based hypervisor will be an advantage.
- Expertise with Netapp, Dell or HPE storage.
- Expertise with Veeam or Veritas Netbackup backup software. Experiences on Chinese based backup software will be an advantage.
- Problem solving skill with analytical thinking.
- Enthusiastic about China technologies.
- Proficient in Cantonese, English and Mandarin.
- Excellent written English and Chinese.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
- Degree holder in Computer Science, Information Technology or relevant discipline;
- Minimum of 3 years of IT-related work experience (more than 5 years is preferable);
- Familiar with network, TCP/IP and Oracle database;
- Familiar with Huawei cloud computing, with experience in installation and troubleshoot of Openstack, Manageone, Fusioncompute, Fusionaccess, Fusionsphere, etc.;
- Familiar with Huawei V3 storage, OceanStor T V2 series storage, SNS series storage devices, familiar with the installation and troubleshoot of RH series and E9000 series servers, and has delivered storage active-active and disaster recovery priority;
- Familiar with the installation and troubleshoot of servers, switches, firewalls and other equipment;
- Applicants with HCIE-cloud and HCIE-storage is preferable;
- Excellent problem-solving and strong analytical skills;
- Good command of written and spoken English/Cantonese/Mandarin.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
BancLogix System Co., Limitedis an associate company ofKVB Group,an international financial services corporation with operations or offices in Auckland, Sydney, Melbourne, Hong Kong, Taiwan, Toronto and China.
In light of our rapid business expansion, we would like to invite professional candidate for this position based in our Hong Kong office.
Infrastructure Engineer (Network and System) - IT Operations
Responsibilities
- Responsible for setup, maintenance, support and troubleshooting of IT-related hardware & network.
- Design, set up, maintenance & support network infrastructure on WAN base between all internal & customer sites.
- Provide system support on companywide systems, such as Windows Servers, Database servers, anti-virus systems, email servers, networking, etc.
- Perform daily system administration duties and housekeeping on companywide data & IT resources.
- Assist in providing technical support and solution design for internal & customer projects.
- Assist in writing documentation on networking & system setup.
Requirements
- Bachelor degree in Computer Science or equivalent
- Solid experience in network & system integration and support.
- Technical knowledge of Linux, Windows Server and Microsoft SQL server, VMware/vSphere
- AWS Experience will be a plus
- 2+ years relevant working experience
- Self-motivated, independent and strong problem solving & analytical skill
We offer 5-day workweek, and attractive package for the right candidate. If you would like to take the challenge to stretch your ability and form part of our team, please send your CV with your current and expected salary.
For further information, please visit our website:
Information provided will be treated in the strictest confidence and used only for recruitment related purposes. All applications will be destroyed after 6 months. Only short listed candidates will be contacted.
Infrastructure Engineer
Posted 5 days ago
Job Viewed
Job Description
We are a cutting-edge AI startup specializing in next-generation video generation technology based in Hong Kong. Our mission is to push the boundaries of what's possible in AI-driven video generation through innovation of foundation model. As a growing startup, we offer a dynamic environment where your research can have immediate impact on technology development.
Position OverviewWe are seeking an experienced Infrastructure Engineer to architect and manage our AI computing infrastructure. The ideal candidate will have extensive experience in building and scaling ML infrastructure, with particular emphasis on distributed training systems and GPU cluster management.
Key Responsibilities- Design and implement high-performance computing infrastructure for large-scale AI model training
- Manage and optimize GPU clusters for distributed training workloads
- Build and maintain container orchestration systems for ML workflows
- Implement efficient resource allocation and scheduling systems
- Design and maintain monitoring and alerting systems for compute infrastructure
- Optimize infrastructure costs while maintaining performance
- Collaborate with ML teams to support their computing needs
- Ensure system reliability, security, and scalability
- Master's degree in Computer Science, Systems Engineering, or related field
- 8+ years of experience in infrastructure engineering, with focus on ML/AI infrastructure
- Strong experience with:
- GPU cluster management and optimization
- Kubernetes and container orchestration
- Infrastructure as Code (IaC)
- Proven track record in building large-scale computing systems
- Experience with major cloud providers (AWS/GCP/Azure or Alibaba Cloud/Tencent Cloud etc)
- Experience with ML infrastructure at major tech companies
- Knowledge of distributed training systems (PyTorch DDP, Horovod)
- Familiarity with ML frameworks and their infrastructure requirements
- Experience with high-performance networking (InfiniBand, RDMA)
- Background in performance optimization and troubleshooting
- Understanding of ML workload characteristics
Computing Infrastructure
- Job Scheduling: YARN, Slurm
- Networking: InfiniBand, RDMA, TCP/IP optimization
Infrastructure Management
- IaC: Terraform, Ansible, CloudFormation
Development
- Languages: Python, Go, Shell scripting
- Version Control: Git
- Documentation: Markdown, Confluence
- Opportunity to build cutting-edge AI infrastructure
- Competitive salary and equity package
- Access to latest hardware and technologies
- Learning and conference budget
- Hong Kong (on-site, Hong Kong Science and Technology Park)
- Design and implement next-generation AI computing infrastructure
- Optimize resource utilization and cost efficiency
- Improve training speed and efficiency for AI models
- Build scalable and reliable systems
- Building automated GPU cluster management systems
- Setting up monitoring and observability systems
- Designing disaster recovery and backup solutions
Please submit the following:
- Detailed CV highlighting relevant infrastructure projects
- Description of the largest scale system you've built/managed
- Examples of infrastructure optimization achievements
- Professional references
To apply or learn more about this position, please contact
Note: This posting reflects a current opening. Only shortlisted candidates will be contacted.
#J-18808-LjbffrBe The First To Know
About the latest Software infrastructure Jobs in Hong Kong !
Cloud Infrastructure Engineer
Posted today
Job Viewed
Job Description
Job Description:
- Assist in the implementation of a centralized IT platform that incorporates DevOps tools and supports automation for deployment, operation and monitoring of IT Systems hosting on Government Cloud Infrastructure Services and on premises
- Configure network and server infrastructure on Government Cloud Infrastructure Services (GCIS) environment and on premises
- Perform on-going IT support and maintenance activities, including scripts development, platform configurations, unit and integration test, prepare testing plan, specifications and guidelines, etc.
- Take up other IT system works as assigned by the supervisors
Requirement Details:
- Higher Diploma or above in IT related discipline
- At least 3 years' experience in IT infrastructure
- Possess strong technical knowledge in various DevOps tools, cloud infrastructure and networking
- Shall have solid experience in Unix/Linux Shell Scripting
- Shall be familiar with automation of deployment, operation, monitoring and logging for IT Systems
- Shall have solid hands-on experience in applying DevOps lifecycle to IT Systems using tools and scripts
- Experienced in Government project
- Shall work independently and shall be good in spoken and written communication skills
If interested in the above post, please send full resume with academic background, work history, current and expected salary
For more job opportunity, please visit our website:
The personal information collected is strictly for recruitment purpose only.
Cloud Infrastructure Engineer
Posted today
Job Viewed
Job Description
We are a multi-cloud managed service provider to support our customers and their ongoing digital transformation journey. We help our customers in leveraging their existing infrastructure more effectively and providing professional technical advice to deploy, maintain and innovate solutions using the latest cloud technologies.
- Provide technical support to customers as well as enhance, maintain and troubleshoot any related technical services
- Ensure smooth project delivery which align to service level agreement
- Manage customers' cloud infrastructure environment in accordance with the company's cloud strategy, policies, best practices and capabilities
- Work closely with customers' IT division and provide duties include but not limited to: monitoring, patching, provisioning, troubleshoot, incident handling, deployment, etc
- Act as Technical Account representative for dedicated clients
IT Infrastructure Engineer
Posted today
Job Viewed
Job Description
Responsibilities
- Design and implement IT infrastructure solutions that meet organizational needs.
- Manage and maintain networks and servers, ensuring optimal performance and security.
- Monitor system performance, conduct OS hardening, and troubleshoot issues as they arise.
- Collaborate with other IT teams to integrate infrastructure with applications and services.
- Implement patch management and ensure compliance with IT security policies.
- Document infrastructure processes, configurations, and changes.
- Utilize Bash and PowerShell scripting for automation and system management.
- Manage virtualization, container software, and storage solutions.
- Oversee system monitoring and log management to ensure system health.
- Assist in project management and incident management processes.
Requirements
- Tertiary education or above in Computing or a related field.
- Minimum of 4 years of experience in IT infrastructure engineering or a similar role.
- Strong knowledge of Fortigate firewalls and Cisco networking technologies.
- Experience with OS hardening, IT security measures, and patch management.
- Proficient in MS SQL and MySQL database management.
- Knowledge of Bash and PowerShell scripting.
- Familiarity with virtualization technologies (e.g. ESXi, HyperV), container software, and storage management.
- Strong skills in system monitoring and log management.
- Project management and incident management experience.
- Relevant professional certifications are preferred.
- Strong problem-solving and analytical skills.
- Excellent communication and collaboration abilities.
- Proficient in spoken and written English and Chinese.