Senior Data Engineer (Python)
Datasite
il y a 19 jours
Date de publicationil y a 19 jours
S/O
Niveau d'expérienceS/O
Temps pleinType de contrat
Temps pleinDonnées / Big dataCatégorie d'emploi
Données / Big dataDatasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What's yours? Invest your talents in us, and we'll return the compliment.
Job Description:
The kind of data engineer we are looking for is a software engineer first, with a strong focus (at least initially) on Python programming and scalability.
Primary responsibilities
Write Python pipelines for semantic processing (NLP) and data augmentation in general:
- Vectorisation (embeddings) to/from MongoDB/Pinecone
- Named Entity Recognition (NER) to/from MongoDB
- Elasticsearch indexing
Write Python transformation pipelines for derived data around companies
- Compute insightful data points from raw company data
- Maintain Python framework for data point computation
Level up the technical capabilities of your team, esp. junior teammates
Contribute to task breakdown and phasing with Tech Lead
Opportunities
Implement cloud-based data acquisition/ETL pipelines (esp. with Airflow, DBT, Snowflake)
Expand web scraping capabilities (Pub/Sub, GCS, CloudRun)
Career path
Move to a Tech Lead role as the tech organisation grows.
Skillset
5+ years developing scalable industry-ready software applications in Python
3+ years implementing data processing pipelines/ETLs
2+ years working with advanced MongoDB and SQL + Elasticsearch ideally
Solid understanding of computing scalability (multiprocessing/threading, distributed computed)
Some hands-on experience with common/modern data frameworks (esp. GCP, DBT, Snowflake)
Outstanding problem solving skills for performance optimisation
Who you might be
It is hard to set expectations with respect to soft skills, as everybody may contribute in their own special way.
Product-driven. You might feel your professional interests revolve around building complex systems thanks to advanced design patterns. We rather feel strongly about achieving product goals. This might involve taking pragmatic shortcuts, and even possibly writing some inelegant code along the way. That is just fine sometimes - as long as we factor in debt.
Socially open. We are neither the nerdy types nor some cold-blooded serial coders. We often build software making creative (and possibly weird) analogies with music, philosophy or football, and come up with solutions outside of the strict confines of our desks.
Taking responsibility for making things happen. This is about self-motivation, being forward thinking and embracing accountability for pushing the Product into the right direction.
Humble. Everybody knows very little. However, we can get really good in certain areas: we believe in the so-called growth mindset, meaning everybody can learn pretty much everything. This makes us nonetheless humble in our attitude with others.
Curious. We spend a lot of time at the office around our fellow co-workers. We like to share our passions (travel, food, books, sports, etc. etc.) and interact beyond tech.
As a global organization, Datasite knows that diverse perspectives are essential to our success. We're committed to maintaining a diverse workforce to serve our customers around the world. Datasite is an equal opportunity employer (EEO) and furthers the principles of EEO through Affirmative Action.
Job Description:
The kind of data engineer we are looking for is a software engineer first, with a strong focus (at least initially) on Python programming and scalability.
Primary responsibilities
Write Python pipelines for semantic processing (NLP) and data augmentation in general:
- Vectorisation (embeddings) to/from MongoDB/Pinecone
- Named Entity Recognition (NER) to/from MongoDB
- Elasticsearch indexing
Write Python transformation pipelines for derived data around companies
- Compute insightful data points from raw company data
- Maintain Python framework for data point computation
Level up the technical capabilities of your team, esp. junior teammates
Contribute to task breakdown and phasing with Tech Lead
Opportunities
Implement cloud-based data acquisition/ETL pipelines (esp. with Airflow, DBT, Snowflake)
Expand web scraping capabilities (Pub/Sub, GCS, CloudRun)
Career path
Move to a Tech Lead role as the tech organisation grows.
Skillset
5+ years developing scalable industry-ready software applications in Python
3+ years implementing data processing pipelines/ETLs
2+ years working with advanced MongoDB and SQL + Elasticsearch ideally
Solid understanding of computing scalability (multiprocessing/threading, distributed computed)
Some hands-on experience with common/modern data frameworks (esp. GCP, DBT, Snowflake)
Outstanding problem solving skills for performance optimisation
Who you might be
It is hard to set expectations with respect to soft skills, as everybody may contribute in their own special way.
Product-driven. You might feel your professional interests revolve around building complex systems thanks to advanced design patterns. We rather feel strongly about achieving product goals. This might involve taking pragmatic shortcuts, and even possibly writing some inelegant code along the way. That is just fine sometimes - as long as we factor in debt.
Socially open. We are neither the nerdy types nor some cold-blooded serial coders. We often build software making creative (and possibly weird) analogies with music, philosophy or football, and come up with solutions outside of the strict confines of our desks.
Taking responsibility for making things happen. This is about self-motivation, being forward thinking and embracing accountability for pushing the Product into the right direction.
Humble. Everybody knows very little. However, we can get really good in certain areas: we believe in the so-called growth mindset, meaning everybody can learn pretty much everything. This makes us nonetheless humble in our attitude with others.
Curious. We spend a lot of time at the office around our fellow co-workers. We like to share our passions (travel, food, books, sports, etc. etc.) and interact beyond tech.
As a global organization, Datasite knows that diverse perspectives are essential to our success. We're committed to maintaining a diverse workforce to serve our customers around the world. Datasite is an equal opportunity employer (EEO) and furthers the principles of EEO through Affirmative Action.
RÉSUMÉ DE L' OFFRE
Senior Data Engineer (Python)Datasite
Paris
il y a 19 jours
S/O
Temps plein