Springer Nature opens the doors to discovery for researchers, educators, clinicians and other professionals. Every day, around the globe, our imprints, books, journals, platforms and technology solutions reach millions of people.
For over 175 years our brands and imprints have been a trusted source of knowledge to these communities and today, more than ever, we see it as our responsibility to ensure that fundamental knowledge can be found, verified, understood and used by our communities enabling them to improve outcomes, make progress, and benefit the generations that follow.
Springer Nature is one of the world’s leading global research, educational and professional publishers. It is home to an array of respected and trusted brands and imprints, with more than 170 years of combined history behind them, providing quality content through a range of innovative products and services.
Every day, around the globe, our imprints, books, journals and resources reach millions of people, helping researchers and scientists to discover, students to learn and professionals to achieve their goals and ambitions.
The company has almost 13,000 staff in over 50 countries.
We’re looking for a Data Engineer to join Data.SN within Springer Nature Operations. Springer Nature is a leading publisher of scientific books, journals and magazines with over 3000 journal titles and one of the world’s largest corpora of peer-reviewed scientific text data.
You would be joining a new programme of work to transform how Springer Nature uses its data : building up data capabilities, creating a data platform and engineering capability (technology, people and process) to create a foundation for the future, adding value to cross-organisation Initiatives and kick-starting data-driven Innovation.
The job is based London (UK) or in Lisbon (Portugal) and you will work remotely at times with colleagues in many of our global offices including, Berlin (Germany), or in Heidelberg (Germany).
Some travel will be required. This role is on a small, autonomous team and you will be expected to impact what we do and how we work.
We like to keep our processes light, and bureaucracy slim.
Across the programme, our teams are cross-functional, diverse and made up of different experience levels. All team members collaborate to deliver the best solutions that satisfy our customers’ needs.
as well as regular lunch n’ learn sessions to share knowledge.
You have several years of experience in Data / Software Engineering on a cloud platform.
You like working in a collaborative team where there is collective ownership.
You enjoy getting involved with every stage of the engineering lifecycle.
Have an understanding of data and distributed systems concepts.
You understand the benefits agile software engineering practises can bring to data engineering.
For examples :
You understand the benefits of Test Driven Development and automation.
You are comfortable pair programming and practising continuous integration and continuous delivery.
You see the value in developers owning production software and view failure as a chance to learn.
What you will be doing
Within 3 Months you will :
Get familiar with our emerging technology stack and data landscape.
Align yourself with the work of the data platform team and understand the data requirements and issues facing our users.
Collaborate effectively with each discipline on the team.
Actively participate in technical discussions and share ideas.
Work with architects and other data engineers in the organisation to align the data processing architecture
By 3-6 months you will :
Have an understanding of the team’s context within the wider organisation.
Be a supportive member of the team, developing the platform by using the appropriate technology solutions to solve the problem at hand.
Triage support queries and diagnose issues in our live applications.
Identify new sources of data across the organisation and build relationships with data providers to gain access.
Understand the processes by which data is acquired and any resulting limitations or bias and communicate this to the team.
Develop and maintain data pipelines to load data into systems like BigQuery, to analyse, clean and join datasets, in an automated, repeatable way.
Ensure that data is stored securely and in compliance with GDPR.
Work with data owners to understand how we can allow them to self-serve their data using tools we develop.
By 6-12 months you will :
Develop processes and tools to monitor feeds and test data integrity and completeness and to alert users when a problem occurs.
Understand our customers’ needs, both internal and external, and how your work affects their experience.
Able to gauge the complexity or scope of a piece of work, breaking it into smaller pieces when appropriate.
Give and receive constructive feedback within your team.
Mentor other members of the team in the principles of data engineering and promote best practice.
Promote and advocate the use of data across Springer Nature.
If you have an interest in data science there may be opportunities to apply machine learning techniques to these datasets to assist in the work of domain teams.
Day to day responsibilities
As part of an Agile product team, day-to-day you will :
Take part in our daily stand-ups.
Contribute to ceremonies like steering, story writing, collaborative design and retrospectives.
Develop new features and improve code quality by pair programming with other team members.
Take part in the support and monitoring of our services.