PhD studentship in Declarative Data Pipelines

Deadline for applications: 15th of April (or until the position if filled)

One fully funded PhD position to work with Dr Milos Nikolic in the School of Informatics at the University of Edinburgh, on a project titled “Declarative Data Pipelines”.

The project aims to address the challenges faced by organizations in managing complex data pipelines for their ERP systems. By developing a domain-specific language (DSL) and a compilation framework, the project seeks to simplify pipeline development, improve efficiency, and ensure data correctness and timeliness.

Key Objectives

  • DSL Development: Develop an intuitive and expressive DSL for expressing data processing logic and constraints in a declarative manner.
  • Compilation Framework: Create a compilation framework to translate DSL specifications into executable scripts compatible with different processing engines.
  • Automatic Orchestration: Design a framework for automatic orchestration of processing pipelines, ensuring data correctness and timeliness without user intervention.
  • Incremental Processing: Enhance pipeline efficiency through incremental processing techniques, reducing re-computation on large datasets

Candidate’s profile

  • Strong background in computer science, with a focus on data management, distributed systems, machine learning, or related fields. Previous coursework or experience (e.g., thesis) in one of these areas is desirable.
  • Proficiency in programming languages such as Python, Java, or Scala.
  • A good Bachelor’s Hons degree (2.1 or above or international equivalent) and/or Master’s degree in a relevant subject.
  • Proficiency in English (both oral and written)
  • Prior research experience with databases and data processing frameworks is highly desirable.

Studentship and eligibility

The studentship covers:

  • Full time PhD tuition fees for a student with a Home fee status (£4,712 per annum) or overseas fee status (£31,500 per annum)
  • A tax free stipend of £19,237 per year for 3.5 years.
  • Additional programme costs of £1,000 per year

Application Information

For informal enquiries about the position, please contact Dr Milos Nikolic with a short summary of your background and research interests in the technical themes mentioned above.

Email Milos Nikolic

Applicants should apply via the University’s admissions portal (EUCLID) and apply for the following programme: Informatics: LFCS: Theory and Foundations of Computer Science, Databases, Software and Systems Modelling with a start date of 01 September 2024.

Applicants should state “PhD in Declarative Data Pipelines” and the research supervisor (Dr Milos Nikolic) in their application and Research Proposal document.

Complete applications submitted by 15 April 2024 will receive full consideration; after that date applications will be considered until the position is filled. The anticipated start date is 01 September 2024.

Applicants must submit:

  • All degree transcripts and certificates (and certified translations if applicable).
  • Evidence of English Language capability (where applicable).
  • A short research proposal (max 2 pages).
  • A full CV and cover letter describing your background, suitability for the PhD, and research interests (max 2 pages).
  • Two references (note that it the applicant’s responsibility to ensure reference letters are received before the deadline).

Only complete applications (i.e. those that are not missing the above documentation) will progress forward to Academic Selectors for further consideration.

Environment

The School of Informatics is one of the largest in Europe and currently the top Informatics institute in the UK for research power, with 40% of its research outputs considered world-leading (top grade), and almost 50% considered top grade for societal impact. The University of Edinburgh is constantly ranked among the world’s top universities and is a highly international environment with several centres of excellence.