Are you interested in software engineering missions and the fantastic challenges of data engineering at scale? It could be you!
We understand you may not know everything, so we are looking for an ambitious individual willing to learn, and we’ll grow together. From our experience, we can say there is no better spot to learn than an early-stage startup with a great tech foundation.
The job
As a data engineer intern, you will :
Design, implement, and automate deployment of our system for collecting and processing log events from multiple sources (browser extensions, APIs, SSO connections).
Design data schema and operate internal data warehouses and SQL/NoSQL database systems.
Manage, improve and operate internal S3 Datalake as well as its structure.
Use Artificial intelligence and Large Language models to extract patterns and knowledge from vast data sets.
Own the design, development, and maintenance of ongoing metrics, reports, analyses, and dashboards that engineers, analysts, and data scientists use to drive key business decisions.
Monitor and troubleshoot operational or data issues in the data pipelines.
Drive architectural plans and implementation for future data storage, reporting, and analytic solutions.
Develop code based automated data pipelines able to process millions of data points.
Improve datalake and data warehouse performance by tuning inefficient queries.
Anticipate and implement solutions to ensure scalability of our Infrastructure and reasonable costs.
Work collaboratively with Product and Business teams to identify opportunities/problems.
Provide assistance to the team with troubleshooting, researching the root cause, and thoroughly resolving defects in the event of a problem.