Computation on the PID graph with graphQL queries (available)

Introduction Persistent Identifiers (PIDs) are a mechanism to provide persistent identification to entities which cannot be guarenteed by other identifiers such as a URL. The most well known of these are DOI’s (https://www.doi.org/) which typically identify published articles, but a wide variety of other identifiers exist, such as ORCiD’s to identify individuals.┬áPIDs are not only … full description “Computation on the PID graph with graphQL queries (available)”

Data Stewardship (completed)

Starting Date: June 2020 Duration: 5 weeks (10 weeks part-time) Time commitment: Full time/Part time Prerequisites: understanding of databases and formats such as JSON; ability to interview and liaise non-experts; ability to write reports. Approximately 80% of the time that a Data Scientist spends on a day to day basis is on finding relevant data … full description “Data Stewardship (completed)”

Interactive Visualisation of Disentangled Representations (available)

This project aims to develop an interactive visualisation toolkit based on existing technologies (IPython & Plotly) that will assist researchers in debugging and understanding complex models in the area of representation learning. Representation learning is a sub-field of machine learning that focuses on developing techniques for representing objects that exist in high-dimensional space (e.g. faces … full description “Interactive Visualisation of Disentangled Representations (available)”

Jupyter notebooks (available)

Starting Date: June 2020 Duration: 10 weeks Time commitment: Full time Prerequisites: experience with Python (useful) and Javascript programming (essential). It may be useful to be willing to learn about functional programming (but this is not essential). Jupyter notebooks [1] are examples of literate programming [2] where code and outputs from the code as well … full description “Jupyter notebooks (available)”

Metagenomics pipeline (available)

Starting Date: June 2020 Duration: 10 weeks Time commitment: Full time Prerequisites: experience with Python programming or workflow software (desirable), experience of querying web databases using RESTful interfaces. Metagenomics is the genomic sampling of environments (e.g. soil, sea water the human microbiome) which are composed of an unknown range of different species (usually bacteria and … full description “Metagenomics pipeline (available)”

Workflow Description Language frontend (completed)

Starting Date: June 2019 Duration: 10 weeks Time commitment: Full time Prerequisites: experience with Python or Java programming (essential), experience of using container software such as Docker and deploying applications on clouds. The Workflow Description Language [1] (WDL – pronounced ‘widdle’) is a scripting language designed to build Scientific workflows (specifically for Bioinformatics applications). WDL … full description “Workflow Description Language frontend (completed)”