Quickstart¶

Info

This quickstart shows how to compute FAIRsoft scores for multiple GitHub repositories using the Software Observatory APIs.

Jupyter notebook

Download the jupyter notebook of this quickstart here

Requirements¶

Install the required Python packages:

pip install requests pandas

You will also need a GitHub personal access token with read permisions for repositories.

Example script¶

import requests
import pandas as pd

token = "YOUR_GITHUB_TOKEN"

repositories = {
    "Substra": "https://github.com/Substra/substrafl",
    "Flower": "https://github.com/adap/flower",
    "Vantage6": "https://github.com/vantage6/vantage6",
}

GITHUB_METADATA_API = "https://observatory.openebench.bsc.es/github-metadata-api/metadata/user"
OBSERVATORY_API = "https://observatory.openebench.bsc.es/api/fair/evaluate"


def get_repository_metadata(owner, repo):

    payload = {
        "owner": owner,
        "repo": repo,
        "userToken": token,
        "prepare": False
    }

    r = requests.post(GITHUB_METADATA_API, json=payload)
    r.raise_for_status()

    return r.json()["data"]


def compute_fairsoft(metadata):

    r = requests.post(
        OBSERVATORY_API,
        json={"tool_metadata": metadata, "prepare": False}
    )

    r.raise_for_status()

    return r.json()


results = []

for tool, url in repositories.items():

    owner = url.split("/")[-2]
    repo = url.split("/")[-1]

    metadata = get_repository_metadata(owner, repo)

    metadata["type"] = "lib"
    metadata["version_control"] = True

    evaluation = compute_fairsoft(metadata)

    result = evaluation["result"]

    results.append({
        "tool": tool,
        "F": result["F"],
        "A": result["A"],
        "I": result["I"],
        "R": result["R"]
    })


df = pd.DataFrame(results)

print(df)

Example output:

      tool      F      A      I      R
0   Substra   0.82   0.75   0.61   0.79
1   Flower    0.91   0.80   0.73   0.84
2   Vantage6  0.77   0.69   0.66   0.81

What the script does¶

Retrieves metadata from GitHub using the GitHub Metadata API
Adds minimal required metadata
Sends the metadata to the FAIRsoft evaluation API
Collects the resulting FAIR scores

Next step¶

To better understand how the metadata extraction and evaluation work, continue with:

Full workflow tutorial