Job Summary:
Onehouse is a mission-driven company dedicated to freeing data from data platform lock-in. The Software Engineer, Distributed Data Systems will be responsible for building and optimizing the next generation of the data tech stack, working closely with Apache Hudi's transactional engine and contributing to the development of a scalable data infrastructure.
Responsibilities:
• As a foundational member of the Data Infrastructure team, you will productionize the next generation of our data tech stack by building the software and data features that actually process all of the data we ingest.
• Accelerate our open source enterprise flywheel by working on the guts of Apache Hudi's transactional engine and optimizing it for diverse Onehouse customer workloads.
• Act as a SME to deepen our teams' expertise on database internals, query engines, storage and/or stream processing.
• Design new concurrency control and transactional capabilities that maximize throughput for competing writers.
• Design and implement new indexing schemes, specifically optimized for incremental data processing and analytical query performance.
• Design systems that help scale and streamline metadata and data access from different query/compute engines.
• Solve hard optimization problems to improve the efficiency (increase performance and lower cost) of distributed data processing algorithms over a Kubernetes cluster.
• Leverage data from existing systems to find inefficiencies, and quickly build and validate prototypes.
• Collaborate with other engineers to implement and deploy, safely rollout the optimized solutions in production.
Qualifications:
Required:
• Strong, object-oriented design and coding skills (Java and/or C/C++ preferably on a UNIX or Linux platform).
• Experience with inner workings of distributed (multi-tiered) systems, algorithms, and relational databases.
• You embrace ambiguous/undefined problems with an ability to think abstractly and articulate technical challenges and solutions.
• An ability to prioritize across feature development and tech debt with urgency and speed.
• An ability to solve complex programming/optimization problems.
• An ability to quickly prototype optimization solutions and analyze large/complex data.
• Robust and clear communication skills.
Preferred:
• Experience working with database systems, Query Engines or Spark codebases.
• Experience in optimization mathematics (linear programming, nonlinear optimization).
• Existing publications of optimizing large-scale data systems in top-tier distributed system conferences.
• PhD degree with 2+ years industry experience in solving and delivering high-impact optimization projects.
Company:
Onehouse is a cloud-native managed lakehouse service that aims to improve data lake time-to-value. Founded in 2021, the company is headquartered in Menlo Park, California, USA, with a team of 51-200 employees. The company is currently Growth Stage. Onehouse has a track record of offering H1B sponsorships.
(Physician/MD qualifications required) Hematology and Oncology - Hematologist|Medical Oncologist Opportunity Manhattan: NY Our client is seeking Board Certified|Board Eligible Hematologist|Medical Oncologist to join our large, well-established multi-specialty...
...website at . Job Summary: The Southwest Healthcare Regional office in Temecula, CA is seeking a Full-Time RN Central Utilization Review Nurse who will be responsible for carrying out utilization management functions by planning, coordinating, and managing patient...
Description of the role:The PCA/CNA - Personal Care Assistant at HomeWell of Anchorage provides essential personal care and support to clients in their homes. This role is crucial in ensuring the well-being and comfort of individuals who require assistance with daily...
...Caring Senior Service of El Paso is Hiring Caregivers on the Westside of El Paso!! Are you a compassionate individual who enjoys providing... ...Do you have a heart for helping seniors, including those with dementia, and a desire to make a positive impact in their lives? If so,...
RF Systems EngineerJob DescriptionWe are seeking a highly skilled RF Systems Engineer to join our team. The ideal candidate will work closely with clients to understand their needs, design customized RF measurement systems, and ensure successful installation and operation...