Page MenuHomec4science

10_oacct_terms.md
No OneTemporary

File Metadata

Created
Mon, May 20, 22:00

10_oacct_terms.md

This file is larger than 256 KB, so syntax highlighting was skipped.
# Projet Open Access Compliance Check Tool (OACCT)
Projet P5 de la bibliothèque de l'EPFL en collaboration avec les bibliothèques des Universités de Genève, Lausanne et Berne : https://www.swissuniversities.ch/themen/digitalisierung/p-5-wissenschaftliche-information/projekte/swiss-mooc-service-1-1-1-1
Ce notebook permet de modifier les données extraites des differentes sources et les exporter dans les tables de l'application OACCT.
Auteur : **Pablo Iriarte**, Université de Genève (pablo.iriarte@unige.ch)
Date de dernière mise à jour : 08.09.2021
```python
import pandas as pd
import csv
import json
import numpy as np
import os
# afficher toutes les colonnes
pd.set_option('display.max_columns', None)
# definir le debut des ids
id_start = 1
```
## Import du fichier extrait de Sherpa
```python
sherpa = pd.read_csv('sample/sherpa_policies_brut.tsv', encoding='utf-8', header=0, sep='\t')
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>license</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
</tr>
</tbody>
</table>
<p>8595 rows × 29 columns</p>
</div>
```python
# test des valeurs pour les versions
sherpa['article_version'].value_counts()
```
published 4688
accepted 3251
submitted 656
Name: article_version, dtype: int64
```python
# test des valeurs pour les issns
sherpa.loc[sherpa['issn'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>license</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
# ajout des ISSN-L
issns = pd.read_csv('issn/20171102.ISSN-to-ISSN-L.txt', encoding='utf-8', header=0, sep='\t')
issns
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ISSN</th>
<th>ISSN-L</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0000-0019</td>
<td>0000-0019</td>
</tr>
<tr>
<td>1</td>
<td>0000-0027</td>
<td>0000-0027</td>
</tr>
<tr>
<td>2</td>
<td>0000-0043</td>
<td>0000-0043</td>
</tr>
<tr>
<td>3</td>
<td>0000-0051</td>
<td>0000-0051</td>
</tr>
<tr>
<td>4</td>
<td>0000-006X</td>
<td>0000-006X</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1995913</td>
<td>8756-9957</td>
<td>8756-9957</td>
</tr>
<tr>
<td>1995914</td>
<td>8756-9965</td>
<td>8756-9965</td>
</tr>
<tr>
<td>1995915</td>
<td>8756-9973</td>
<td>8756-9973</td>
</tr>
<tr>
<td>1995916</td>
<td>8756-9981</td>
<td>8756-9981</td>
</tr>
<tr>
<td>1995917</td>
<td>8756-999X</td>
<td>8756-999X</td>
</tr>
</tbody>
</table>
<p>1995918 rows × 2 columns</p>
</div>
```python
# renommer les colonnes
issns = issns.rename(columns={'ISSN' : 'issn', 'ISSN-L' : 'issnl'})
issns
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0000-0019</td>
<td>0000-0019</td>
</tr>
<tr>
<td>1</td>
<td>0000-0027</td>
<td>0000-0027</td>
</tr>
<tr>
<td>2</td>
<td>0000-0043</td>
<td>0000-0043</td>
</tr>
<tr>
<td>3</td>
<td>0000-0051</td>
<td>0000-0051</td>
</tr>
<tr>
<td>4</td>
<td>0000-006X</td>
<td>0000-006X</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1995913</td>
<td>8756-9957</td>
<td>8756-9957</td>
</tr>
<tr>
<td>1995914</td>
<td>8756-9965</td>
<td>8756-9965</td>
</tr>
<tr>
<td>1995915</td>
<td>8756-9973</td>
<td>8756-9973</td>
</tr>
<tr>
<td>1995916</td>
<td>8756-9981</td>
<td>8756-9981</td>
</tr>
<tr>
<td>1995917</td>
<td>8756-999X</td>
<td>8756-999X</td>
</tr>
</tbody>
</table>
<p>1995918 rows × 2 columns</p>
</div>
```python
# merge avec la table sherpa
sherpa = pd.merge(sherpa, issns, on='issn', how='left')
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>license</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
</tr>
</tbody>
</table>
<p>8595 rows × 30 columns</p>
</div>
```python
# test des valeurs pour les issnl
sherpa.loc[sherpa['issnl'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>license</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
# extraction des données IR Archiving + Embargo par ISSN
sherpa_ir = sherpa[['issnl', ]]
```
## Import du fichier des licences Read & Publish
```python
rp = pd.read_csv('sample/read_publish_brut_merge.tsv', encoding='utf-8', header=0, sep='\t')
rp
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\IPython\core\interactiveshell.py:3058: DtypeWarning: Columns (0,1,3,4) have mixed types. Specify dtype option on import or set low_memory=False.
interactivity=interactivity, compiler=compiler, result=result)
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>CUP</th>
<th>Elsevier</th>
<th>issn</th>
<th>Springer Nature</th>
<th>TF</th>
<th>Title</th>
<th>URL</th>
<th>Wiley</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>license</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ROR</th>
<th>journal</th>
<th>rp_id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
</tr>
<tr>
<td>40079</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
</tr>
<tr>
<td>40080</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
</tr>
<tr>
<td>40081</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
</tr>
<tr>
<td>40082</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
</tr>
</tbody>
</table>
<p>40083 rows × 18 columns</p>
</div>
```python
rp['embargo_months'].value_counts()
```
0 39163
60 920
Name: embargo_months, dtype: int64
```python
# ajout de l'éditeur dans un seul champ
# rp.loc[rp['Elsevier'] == 'x', 'public_notes'] = 'Elsevier Read & Publish agreement'
rp.loc[rp['Elsevier'] == 'x', 'rp_publisher'] = 'Elsevier'
rp.loc[rp['Springer Nature'] == 'x', 'rp_publisher'] = 'Springer Nature'
rp.loc[rp['Wiley'] == 'x', 'rp_publisher'] = 'Wiley'
rp.loc[rp['TF'] == 'x', 'rp_publisher'] = 'TF'
rp.loc[rp['CUP'] == 'x', 'rp_publisher'] = 'CUP'
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>CUP</th>
<th>Elsevier</th>
<th>issn</th>
<th>Springer Nature</th>
<th>TF</th>
<th>Title</th>
<th>URL</th>
<th>Wiley</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>license</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ROR</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
</tr>
<tr>
<td>1</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
</tr>
<tr>
<td>2</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
</tr>
<tr>
<td>3</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
</tr>
<tr>
<td>4</td>
<td>NaN</td>
<td>x</td>
<td>1742-7061</td>
<td>NaN</td>
<td>NaN</td>
<td>Acta Biomaterialia</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
</tr>
<tr>
<td>40079</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
</tr>
<tr>
<td>40080</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
</tr>
<tr>
<td>40081</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
</tr>
<tr>
<td>40082</td>
<td>x</td>
<td>NaN</td>
<td>1435-8115</td>
<td>NaN</td>
<td>NaN</td>
<td>Microscopy and Microanalysis</td>
<td>http://www.cambridge.org/core/product/identifi...</td>
<td>NaN</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
</tr>
</tbody>
</table>
<p>40083 rows × 19 columns</p>
</div>
```python
# test des valeurs pour les versions
rp['rp_publisher'].value_counts()
```
Elsevier 18128
Wiley 13905
Springer Nature 6716
CUP 920
TF 414
Name: rp_publisher, dtype: int64
```python
# test des valeurs pour les versions
rp['license'].value_counts()
```
cc_by 17701
cc_by_nc_nd 13929
cc_by_nc 8223
cc_by_nc_sa 230
Name: license, dtype: int64
```python
# supprimer les champs inutiles et renommer les colonnes
del rp['Elsevier']
del rp['Springer Nature']
del rp['Wiley']
del rp['TF']
del rp['CUP']
del rp['URL']
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>Title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>license</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ROR</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
</tr>
</tbody>
</table>
<p>40083 rows × 13 columns</p>
</div>
```python
# renommer les colonnes
rp = rp.rename(columns = {'Title' : 'title', 'ROR' : 'ror', 'read_publish_id' : 'rp_id'})
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>license</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
</tr>
</tbody>
</table>
<p>40083 rows × 13 columns</p>
</div>
## Table applicable_version
```python
# creation du DF
col_names = ['id',
'type',
'description'
]
applicable_version = pd.DataFrame(columns = col_names)
# 3 values : published, accepted, submitted
new_row1 = {'id':1, 'type':'submitted', 'description' : 'Submitted version'}
new_row2 = {'id':2, 'type':'accepted', 'description' : 'Accepted version'}
new_row3 = {'id':3, 'type':'published', 'description' : 'Published version'}
#append row to the dataframe
applicable_version = applicable_version.append(new_row1, ignore_index=True)
applicable_version = applicable_version.append(new_row2, ignore_index=True)
applicable_version = applicable_version.append(new_row3, ignore_index=True)
applicable_version
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>type</th>
<th>description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>submitted</td>
<td>Submitted version</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>accepted</td>
<td>Accepted version</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>published</td>
<td>Published version</td>
</tr>
</tbody>
</table>
</div>
```python
# ajout de la valeur UNKNOWN
applicable_version = applicable_version.append({'id' : 999999, 'type' : 'UNKNOWN', 'description' : 'UNKNOWN'}, ignore_index=True)
applicable_version
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>type</th>
<th>description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>submitted</td>
<td>Submitted version</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>accepted</td>
<td>Accepted version</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>published</td>
<td>Published version</td>
</tr>
<tr>
<td>3</td>
<td>999999</td>
<td>UNKNOWN</td>
<td>UNKNOWN</td>
</tr>
</tbody>
</table>
</div>
```python
# renommage des champs finaux
applicable_version_export = applicable_version[['id', 'description']]
```
```python
# export de la table applicable_version
result = applicable_version_export.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/version.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export csv
applicable_version_export.to_csv('sample/version.tsv', sep='\t', encoding='utf-8', index=False)
```
```python
# export excel
applicable_version_export.to_excel('sample/version.xlsx', index=False)
```
```python
# merge avec la table sherpa
sherpa = pd.merge(sherpa, applicable_version[['id', 'type']], left_on='article_version', right_on='type', how='left')
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>license</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id_x</th>
<th>issnl</th>
<th>id_y</th>
<th>type</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
<td>submitted</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
<td>accepted</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
<td>submitted</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
<td>submitted</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
<td>accepted</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
<td>published</td>
</tr>
</tbody>
</table>
<p>8595 rows × 32 columns</p>
</div>
```python
sherpa = sherpa.rename(columns = {'id_x' : 'id', 'id_y' : 'version'})
del sherpa['type']
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>license</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
</tr>
</tbody>
</table>
<p>8595 rows × 31 columns</p>
</div>
```python
# merge avec la table read & publish
rp = pd.merge(rp, applicable_version[['id', 'type']], left_on='article_version', right_on='type', how='left')
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>license</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
<th>id</th>
<th>type</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
<td>3</td>
<td>published</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
<td>3</td>
<td>published</td>
</tr>
</tbody>
</table>
<p>40083 rows × 15 columns</p>
</div>
```python
rp = rp.rename(columns = {'id' : 'version'})
del rp['type']
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>license</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
<th>version</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
<td>3</td>
</tr>
</tbody>
</table>
<p>40083 rows × 14 columns</p>
</div>
## Table oa_licence
```python
# creation du DF
# 'version' n'est pas utilisée, on dédoublonne par nom sans la version
col_names = ['id',
'name',
'url'
]
oa_licence = pd.DataFrame(columns = col_names)
oa_licence
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
<th>url</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
# export des licences
sherpa['license'].value_counts()
```
cc_by 4151
cc_by_nc_nd 2338
cc_by_nc 559
bespoke_license 47
cc_by_nc_sa 20
cc_by_nd 7
cc_by_sa 4
cc0 3
all_rights_reserved 1
Name: license, dtype: int64
```python
sherpa_licences = sherpa['license'].drop_duplicates()
sherpa_licences = sherpa_licences.dropna()
sherpa_licences
```
2 cc_by
3 cc_by_nc_nd
8 bespoke_license
29 cc_by_nc
425 cc_by_nc_sa
443 all_rights_reserved
2147 cc_by_sa
2148 cc_by_nd
8420 cc0
Name: license, dtype: object
```python
oa_licence['sherpa_code'] = np.nan
oa_licence
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
<th>url</th>
<th>sherpa_code</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
for code in sherpa_licences:
print (code)
oa_licence = oa_licence.append({'sherpa_code' : code}, ignore_index=True)
```
cc_by
cc_by_nc_nd
bespoke_license
cc_by_nc
cc_by_nc_sa
all_rights_reserved
cc_by_sa
cc_by_nd
cc0
```python
oa_licence
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
<th>url</th>
<th>sherpa_code</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by</td>
</tr>
<tr>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nc_nd</td>
</tr>
<tr>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>bespoke_license</td>
</tr>
<tr>
<td>3</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nc</td>
</tr>
<tr>
<td>4</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nc_sa</td>
</tr>
<tr>
<td>5</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>all_rights_reserved</td>
</tr>
<tr>
<td>6</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_sa</td>
</tr>
<tr>
<td>7</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nd</td>
</tr>
<tr>
<td>8</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>cc0</td>
</tr>
</tbody>
</table>
</div>
```python
# convertir l'index en id
oa_licence = oa_licence.reset_index()
# ajout de l'id avec l'index + 1
oa_licence['id'] = oa_licence['index'] + 1
del oa_licence['index']
oa_licence
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
<th>url</th>
<th>sherpa_code</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nc_nd</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>NaN</td>
<td>NaN</td>
<td>bespoke_license</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nc</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nc_sa</td>
</tr>
<tr>
<td>5</td>
<td>6</td>
<td>NaN</td>
<td>NaN</td>
<td>all_rights_reserved</td>
</tr>
<tr>
<td>6</td>
<td>7</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_sa</td>
</tr>
<tr>
<td>7</td>
<td>8</td>
<td>NaN</td>
<td>NaN</td>
<td>cc_by_nd</td>
</tr>
<tr>
<td>8</td>
<td>9</td>
<td>NaN</td>
<td>NaN</td>
<td>cc0</td>
</tr>
</tbody>
</table>
</div>
```python
# ajout du nom et des URLs
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by', 'name'] = 'CC BY'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by', 'url'] = 'https://creativecommons.org/licenses/by/4.0/'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_sa', 'name'] = 'CC BY-SA'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_sa', 'url'] = 'https://creativecommons.org/licenses/by-sa/4.0/'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nc', 'name'] = 'CC BY-NC'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nc', 'url'] = 'https://creativecommons.org/licenses/by-nc/4.0/'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nc_sa', 'name'] = 'CC BY-NC-SA'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nc_sa', 'url'] = 'https://creativecommons.org/licenses/by-nc-sa/4.0/'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nd', 'name'] = 'CC BY-ND'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nd', 'url'] = 'https://creativecommons.org/licenses/by-nd/4.0/'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nc_nd', 'name'] = 'CC BY-NC-ND'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_by_nc_nd', 'url'] = 'https://creativecommons.org/licenses/by-nc-nd/4.0/'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc0', 'name'] = 'CC0'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc0', 'url'] = 'https://creativecommons.org/publicdomain/zero/1.0/'
oa_licence.loc[oa_licence['sherpa_code'] == 'bespoke_license', 'name'] = 'Specific license'
oa_licence.loc[oa_licence['sherpa_code'] == 'bespoke_license', 'url'] = ''
oa_licence.loc[oa_licence['sherpa_code'] == 'all_rights_reserved', 'name'] = 'All rights reserved'
oa_licence.loc[oa_licence['sherpa_code'] == 'all_rights_reserved', 'url'] = ''
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_gnu_gpl', 'name'] = 'GNU GPL'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_gnu_gpl', 'url'] = 'http://gnugpl.org/'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_public_domain', 'name'] = 'Public domain'
oa_licence.loc[oa_licence['sherpa_code'] == 'cc_public_domain', 'url'] = 'https://creativecommons.org/share-your-work/public-domain/'
# oa_licence.loc[oa_licence['sherpa_code'] == 'bespoke_license', 'url'] = 'https://port.sas.ac.uk/mod/book/view.php?id=1340&chapterid=1003'
oa_licence
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
<th>url</th>
<th>sherpa_code</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>CC BY</td>
<td>https://creativecommons.org/licenses/by/4.0/</td>
<td>cc_by</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>CC BY-NC-ND</td>
<td>https://creativecommons.org/licenses/by-nc-nd/...</td>
<td>cc_by_nc_nd</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>Specific license</td>
<td></td>
<td>bespoke_license</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>CC BY-NC</td>
<td>https://creativecommons.org/licenses/by-nc/4.0/</td>
<td>cc_by_nc</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>CC BY-NC-SA</td>
<td>https://creativecommons.org/licenses/by-nc-sa/...</td>
<td>cc_by_nc_sa</td>
</tr>
<tr>
<td>5</td>
<td>6</td>
<td>All rights reserved</td>
<td></td>
<td>all_rights_reserved</td>
</tr>
<tr>
<td>6</td>
<td>7</td>
<td>CC BY-SA</td>
<td>https://creativecommons.org/licenses/by-sa/4.0/</td>
<td>cc_by_sa</td>
</tr>
<tr>
<td>7</td>
<td>8</td>
<td>CC BY-ND</td>
<td>https://creativecommons.org/licenses/by-nd/4.0/</td>
<td>cc_by_nd</td>
</tr>
<tr>
<td>8</td>
<td>9</td>
<td>CC0</td>
<td>https://creativecommons.org/publicdomain/zero/...</td>
<td>cc0</td>
</tr>
</tbody>
</table>
</div>
```python
# ajout de la valeur UNKNOWN
oa_licence = oa_licence.append({'id' : 999999, 'sherpa_code' : '___', 'name' : 'UNKNOWN', 'url' : ''}, ignore_index=True)
oa_licence
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
<th>url</th>
<th>sherpa_code</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>CC BY</td>
<td>https://creativecommons.org/licenses/by/4.0/</td>
<td>cc_by</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>CC BY-NC-ND</td>
<td>https://creativecommons.org/licenses/by-nc-nd/...</td>
<td>cc_by_nc_nd</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>Specific license</td>
<td></td>
<td>bespoke_license</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>CC BY-NC</td>
<td>https://creativecommons.org/licenses/by-nc/4.0/</td>
<td>cc_by_nc</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>CC BY-NC-SA</td>
<td>https://creativecommons.org/licenses/by-nc-sa/...</td>
<td>cc_by_nc_sa</td>
</tr>
<tr>
<td>5</td>
<td>6</td>
<td>All rights reserved</td>
<td></td>
<td>all_rights_reserved</td>
</tr>
<tr>
<td>6</td>
<td>7</td>
<td>CC BY-SA</td>
<td>https://creativecommons.org/licenses/by-sa/4.0/</td>
<td>cc_by_sa</td>
</tr>
<tr>
<td>7</td>
<td>8</td>
<td>CC BY-ND</td>
<td>https://creativecommons.org/licenses/by-nd/4.0/</td>
<td>cc_by_nd</td>
</tr>
<tr>
<td>8</td>
<td>9</td>
<td>CC0</td>
<td>https://creativecommons.org/publicdomain/zero/...</td>
<td>cc0</td>
</tr>
<tr>
<td>9</td>
<td>999999</td>
<td>UNKNOWN</td>
<td></td>
<td>___</td>
</tr>
</tbody>
</table>
</div>
```python
# ajout aux tables sherpa et rp
sherpa = sherpa.rename(columns = {'license' : 'sherpa_code'})
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
</tr>
</tbody>
</table>
<p>8595 rows × 31 columns</p>
</div>
```python
# ajout aux tables sherpa et rp
rp = rp.rename(columns = {'license' : 'sherpa_code'})
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>sherpa_code</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
<th>version</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
<td>3</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
<td>3</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
<td>3</td>
</tr>
</tbody>
</table>
<p>40083 rows × 14 columns</p>
</div>
```python
# merge
sherpa = pd.merge(sherpa, oa_licence[['sherpa_code', 'id']], on='sherpa_code', how='left')
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id_x</th>
<th>issnl</th>
<th>version</th>
<th>id_y</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
<td>1.0</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
<td>2.0</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
<td>NaN</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
<td>NaN</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
</tr>
</tbody>
</table>
<p>8595 rows × 32 columns</p>
</div>
```python
sherpa = sherpa.rename(columns = {'id_x' : 'id', 'id_y' : 'licence'})
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
<th>licence</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
<td>1.0</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
<td>2.0</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
<td>NaN</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
<td>NaN</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
</tr>
</tbody>
</table>
<p>8595 rows × 32 columns</p>
</div>
```python
# merge
rp = pd.merge(rp, oa_licence[['sherpa_code', 'id']], on='sherpa_code', how='left')
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>sherpa_code</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
<th>version</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
</tbody>
</table>
<p>40083 rows × 15 columns</p>
</div>
```python
rp = rp.rename(columns = {'id' : 'licence'})
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>sherpa_code</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
<th>version</th>
<th>licence</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
</tr>
</tbody>
</table>
<p>40083 rows × 15 columns</p>
</div>
```python
# renommage des champs finaux
oa_licence_export = oa_licence[['id', 'name', 'url']]
oa_licence_export = oa_licence_export.rename(columns={'name' : 'name_or_abbrev', 'url' : 'website'})
```
```python
# export de la table oa_licence
result = oa_licence_export.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/licence.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export csv
oa_licence_export.to_csv('sample/licence.tsv', sep='\t', encoding='utf-8', index=False)
```
```python
# export excel
oa_licence_export.to_excel('sample/licence.xlsx', index=False)
```
## Table cost_factor_type
```python
# creation du DF
col_names = ['id',
'name'
]
cost_factor_type = pd.DataFrame(columns = col_names)
cost_factor_type = cost_factor_type.append({'id' : 1, 'name' : 'APC'}, ignore_index=True)
cost_factor_type = cost_factor_type.append({'id' : 2, 'name' : 'Discount'}, ignore_index=True)
cost_factor_type = cost_factor_type.append({'id' : 3, 'name' : 'Refund'}, ignore_index=True)
cost_factor_type
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>APC</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>Discount</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>Refund</td>
</tr>
</tbody>
</table>
</div>
```python
# ajout de la valeur UNKNOWN
cost_factor_type = cost_factor_type.append({'id' : 999999, 'name' : 'UNKNOWN'}, ignore_index=True)
cost_factor_type
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>name</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>APC</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>Discount</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>Refund</td>
</tr>
<tr>
<td>3</td>
<td>999999</td>
<td>UNKNOWN</td>
</tr>
</tbody>
</table>
</div>
```python
# export de la table
result = cost_factor_type.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/cost_factor_type.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export csv
cost_factor_type.to_csv('sample/cost_factor_type.tsv', sep='\t', encoding='utf-8', index=False)
```
```python
# export excel
cost_factor_type.to_excel('sample/cost_factor_type.xlsx', index=False)
```
## Table cost_factor
### Ajout des données des APCs depuis DOAJ
```python
# ajout de DOAJ info
doaj = pd.read_csv('doaj/journalcsv__doaj_20210312_0636_utf8.csv', encoding='utf-8', header=0)
doaj
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal title</th>
<th>Journal URL</th>
<th>URL in DOAJ</th>
<th>Alternative title</th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
<th>Keywords</th>
<th>Languages in which the journal accepts manuscripts</th>
<th>Publisher</th>
<th>Country of publisher</th>
<th>Society or institution</th>
<th>Country of society or institution</th>
<th>Journal license</th>
<th>License attributes</th>
<th>URL for license terms</th>
<th>Machine-readable CC licensing information embedded or displayed in articles</th>
<th>URL to an example page with embedded licensing information</th>
<th>Author holds copyright without restrictions</th>
<th>Copyright information URL</th>
<th>Review process</th>
<th>Review process information URL</th>
<th>Journal plagiarism screening policy</th>
<th>Plagiarism information URL</th>
<th>URL for journal's aims &amp; scope</th>
<th>URL for the Editorial Board page</th>
<th>URL for journal's instructions for authors</th>
<th>Average number of weeks between article submission and publication</th>
<th>APC</th>
<th>APC information URL</th>
<th>APC amount</th>
<th>Journal waiver policy (for developing country authors etc)</th>
<th>Waiver policy information URL</th>
<th>Has other fees</th>
<th>Other submission fees information URL</th>
<th>Preservation Services</th>
<th>Preservation Service: national library</th>
<th>Preservation information URL</th>
<th>Deposit policy directory</th>
<th>URL for deposit policy</th>
<th>Persistent article identifiers</th>
<th>Article metadata includes ORCIDs</th>
<th>Journal complies with I4OC standards for open citations</th>
<th>Does this journal allow unrestricted reuse in compliance with BOAI?</th>
<th>URL for journal's Open Access statement</th>
<th>Continues</th>
<th>Continued By</th>
<th>LCC Codes</th>
<th>Subjects</th>
<th>DOAJ Seal</th>
<th>Added on Date</th>
<th>Last updated Date</th>
<th>Number of Article Records</th>
<th>Most Recent Article Added</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>Anais da Academia Brasileira de Ciências</td>
<td>http://www.scielo.br/scielo.php?script=sci_ser...</td>
<td>https://doaj.org/toc/ed09859a464f4461b1af34279...</td>
<td>Annals of the Brazilian Academy of Sciences</td>
<td>0001-3765</td>
<td>1678-2690</td>
<td>biological sciences, exact and earth sciences,...</td>
<td>English</td>
<td>Academia Brasileira de Ciências</td>
<td>Brazil</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY</td>
<td>NaN</td>
<td>http://www.scielo.br/revistas/aabc/iaboutj.htm</td>
<td>Yes</td>
<td>http://www.scielo.br/scielo.php?script=sci_art...</td>
<td>No</td>
<td>NaN</td>
<td>Peer review</td>
<td>http://www.scielo.br/revistas/aabc/iinstruc.htm</td>
<td>Yes</td>
<td>http://www.scielo.br/revistas/aabc/iinstruc.htm</td>
<td>http://www.scielo.br/revistas/aabc/iaboutj.htm</td>
<td>http://www.scielo.br/revistas/aabc/iedboard.htm</td>
<td>http://www.scielo.br/revistas/aabc/iinstruc.htm</td>
<td>18</td>
<td>No</td>
<td>http://www.scielo.br/revistas/aabc/iinstruc.htm</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>No</td>
<td>http://www.scielo.br/revistas/aabc/iinstruc.htm</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>DOI</td>
<td>NaN</td>
<td>NaN</td>
<td>Yes</td>
<td>http://www.scielo.br/revistas/aabc/isubscrp.htm</td>
<td>NaN</td>
<td>NaN</td>
<td>Q</td>
<td>Science</td>
<td>No</td>
<td>2004-04-23T21:31:00Z</td>
<td>2017-01-04T14:19:54Z</td>
<td>2649</td>
<td>2020-06-10T21:49:11Z</td>
</tr>
<tr>
<td>1</td>
<td>ACME</td>
<td>http://riviste.unimi.it/index.php/ACME</td>
<td>https://doaj.org/toc/b1ca04ba56194f29a362b3eef...</td>
<td>NaN</td>
<td>0001-494X</td>
<td>2282-0035</td>
<td>italian literature, classic literature, lingui...</td>
<td>Italian</td>
<td>Università degli Studi di Milano</td>
<td>Italy</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY-NC-ND</td>
<td>NaN</td>
<td>http://riviste.unimi.it/index.php/ACME/index</td>
<td>Yes</td>
<td>http://riviste.unimi.it/index.php/ACME/article...</td>
<td>Yes</td>
<td>http://riviste.unimi.it/index.php/ACME/about/e...</td>
<td>Blind peer review</td>
<td>https://riviste.unimi.it/index.php/ACME/about</td>
<td>No</td>
<td>NaN</td>
<td>https://riviste.unimi.it/index.php/ACME/about</td>
<td>https://riviste.unimi.it/index.php/ACME/about/...</td>
<td>http://riviste.unimi.it/index.php/ACME/about/s...</td>
<td>12</td>
<td>No</td>
<td>https://riviste.unimi.it/index.php/Lebenswelt/...</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>No</td>
<td>https://riviste.unimi.it/index.php/Lebenswelt/...</td>
<td>NaN</td>
<td>Italian National Library (BNCF)</td>
<td>http://www.depositolegale.it/</td>
<td>NaN</td>
<td>NaN</td>
<td>DOI, NBN</td>
<td>NaN</td>
<td>NaN</td>
<td>Yes</td>
<td>http://riviste.unimi.it/index.php/ACME/about/e...</td>
<td>NaN</td>
<td>NaN</td>
<td>A</td>
<td>General Works</td>
<td>No</td>
<td>2014-12-22T19:55:58Z</td>
<td>2020-02-24T09:07:42Z</td>
<td>166</td>
<td>2020-06-19T09:42:34Z</td>
</tr>
<tr>
<td>2</td>
<td>Acta Dermato-Venereologica</td>
<td>http://www.medicaljournals.se/acta</td>
<td>https://doaj.org/toc/ffde9666ab1d46f1a8c688ce6...</td>
<td>NaN</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>sexually transmitted infections, psoriasis, ps...</td>
<td>English</td>
<td>Society for Publication of Acta Dermato-Venere...</td>
<td>Sweden</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY-NC</td>
<td>NaN</td>
<td>https://www.medicaljournals.se/acta/open-acces...</td>
<td>NaN</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>Peer review</td>
<td>https://www.medicaljournals.se/acta/instructio...</td>
<td>No</td>
<td>NaN</td>
<td>http://www.medicaljournals.se/acta</td>
<td>https://www.medicaljournals.se/acta/editors</td>
<td>https://www.medicaljournals.se/acta/instructio...</td>
<td>20</td>
<td>Yes</td>
<td>https://www.medicaljournals.se/acta/instructio...</td>
<td>1600 EUR</td>
<td>No</td>
<td>NaN</td>
<td>Yes</td>
<td>https://www.medicaljournals.se/acta/instructio...</td>
<td>NaN</td>
<td>NaN</td>
<td>http://www.ingentaconnect.com/publisher/claimi...</td>
<td>Sherpa/Romeo</td>
<td>NaN</td>
<td>DOI</td>
<td>NaN</td>
<td>NaN</td>
<td>Yes</td>
<td>https://www.medicaljournals.se/acta/open-acces...</td>
<td>NaN</td>
<td>NaN</td>
<td>RL1-803</td>
<td>Medicine: Dermatology</td>
<td>No</td>
<td>2011-11-10T12:31:05Z</td>
<td>2017-02-22T11:14:48Z</td>
<td>1096</td>
<td>2021-03-11T13:41:33Z</td>
</tr>
<tr>
<td>3</td>
<td>Acta Médica Costarricense</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>https://doaj.org/toc/a5919aee5ad2413a89cf32df0...</td>
<td>NaN</td>
<td>0001-6012</td>
<td>2215-5856</td>
<td>medicine, public health, medical sciences, health</td>
<td>English, Spanish</td>
<td>Colegio de Médicos y Cirujanos de Costa Rica</td>
<td>Costa Rica</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY-NC-SA</td>
<td>NaN</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>NaN</td>
<td>NaN</td>
<td>No</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>Double blind peer review</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>Yes</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>12</td>
<td>No</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>PKP PN</td>
<td>NaN</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>Sherpa/Romeo</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>NaN</td>
<td>No</td>
<td>No</td>
<td>Yes</td>
<td>http://actamedica.medicos.cr/index.php/Acta_Me...</td>
<td>NaN</td>
<td>NaN</td>
<td>R</td>
<td>Medicine</td>
<td>No</td>
<td>2020-12-22T11:08:24Z</td>
<td>2020-12-22T11:08:24Z</td>
<td>1207</td>
<td>2015-12-08T15:06:43Z</td>
</tr>
<tr>
<td>4</td>
<td>Acta Mycologica</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>https://doaj.org/toc/0e8e2531ae3f455ebb49acb08...</td>
<td>NaN</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>mycology, micromycetes, marcomycetes, slime mo...</td>
<td>English</td>
<td>Polish Botanical Society</td>
<td>Poland</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY</td>
<td>NaN</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>Yes</td>
<td>https://doi.org/10.5586/am.5511</td>
<td>Yes</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>Double blind peer review</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>Yes</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>16</td>
<td>Yes</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>400 EUR</td>
<td>No</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Sherpa/Romeo</td>
<td>https://v2.sherpa.ac.uk/id/publication/25478</td>
<td>DOI</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>https://pbsociety.org.pl/journals/index.php/am...</td>
<td>NaN</td>
<td>NaN</td>
<td>QH301-705.5</td>
<td>Science: Biology (General)</td>
<td>No</td>
<td>2014-05-29T20:02:32Z</td>
<td>2021-01-16T17:41:32Z</td>
<td>1154</td>
<td>2021-03-05T18:55:46Z</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>BME Frontiers</td>
<td>https://spj.sciencemag.org/bmef</td>
<td>https://doaj.org/toc/f9fa881c1be5443a86ed71c2e...</td>
<td>Biomedical Engineering Frontiers</td>
<td>NaN</td>
<td>2765-8031</td>
<td>biomedical imaging, biomedical devices, biomat...</td>
<td>English</td>
<td>American Association for the Advancement of Sc...</td>
<td>United States</td>
<td>Suzhou Institute of Biomedical Engineering and...</td>
<td>China</td>
<td>CC BY</td>
<td>NaN</td>
<td>https://spj.sciencemag.org/bmef/guidelines/#co...</td>
<td>Yes</td>
<td>https://spj.sciencemag.org/journals/bmef/2020/...</td>
<td>No</td>
<td>https://spj.sciencemag.org/bmef/guidelines/#co...</td>
<td>Blind peer review</td>
<td>https://spj.sciencemag.org/bmef/peer-review-pr...</td>
<td>Yes</td>
<td>https://spj.sciencemag.org/bmef/publication-et...</td>
<td>https://spj.sciencemag.org/bmef/about/#mission...</td>
<td>https://spj.sciencemag.org/bmef/editors/</td>
<td>https://spj.sciencemag.org/bmef/guidelines/</td>
<td>16</td>
<td>No</td>
<td>https://spj.sciencemag.org/bmef/apc/</td>
<td>NaN</td>
<td>Yes</td>
<td>https://spj.sciencemag.org/bmef/apc/</td>
<td>No</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>DOI</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>https://spj.sciencemag.org/bmef/about/</td>
<td>NaN</td>
<td>NaN</td>
<td>R855-855.5|TP248.13-248.65</td>
<td>Medicine: Medicine (General): Medical technolo...</td>
<td>No</td>
<td>2021-01-22T11:54:20Z</td>
<td>2021-01-22T11:54:20Z</td>
<td>11</td>
<td>2021-03-08T09:06:36Z</td>
</tr>
<tr>
<td>16025</td>
<td>Harvard Kennedy School Misinformation Review</td>
<td>https://misinforeview.hks.harvard.edu</td>
<td>https://doaj.org/toc/d71096ec7090499681cc0ccf8...</td>
<td>HKS Misinformation Review</td>
<td>NaN</td>
<td>2766-1652</td>
<td>misinformation, disinformation, fake news</td>
<td>English</td>
<td>Harvard Kennedy School</td>
<td>United States</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY</td>
<td>NaN</td>
<td>https://misinforeview.hks.harvard.edu/editoria...</td>
<td>Yes</td>
<td>https://misinforeview.hks.harvard.edu/article/...</td>
<td>Yes</td>
<td>https://misinforeview.hks.harvard.edu/editoria...</td>
<td>Double blind peer review</td>
<td>https://misinforeview.hks.harvard.edu/editoria...</td>
<td>No</td>
<td>NaN</td>
<td>https://misinforeview.hks.harvard.edu/our-miss...</td>
<td>https://misinforeview.hks.harvard.edu/editoria...</td>
<td>https://misinforeview.hks.harvard.edu/submit/</td>
<td>10</td>
<td>No</td>
<td>https://misinforeview.hks.harvard.edu/editoria...</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>DOI</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
<td>https://misinforeview.hks.harvard.edu/editoria...</td>
<td>NaN</td>
<td>NaN</td>
<td>T58.5-58.64|P87-96</td>
<td>Technology: Technology (General): Industrial e...</td>
<td>No</td>
<td>2021-02-12T10:29:21Z</td>
<td>2021-02-12T10:29:21Z</td>
<td>0</td>
<td>NaN</td>
</tr>
<tr>
<td>16026</td>
<td>One Health &amp; Risk Management</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>https://doaj.org/toc/68671b966cd24a0ebaa44d78f...</td>
<td>OH&amp;RM</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>one health, risc management, public health, hu...</td>
<td>English, Romanian, French, Russian</td>
<td>Asociatia de Biosiguranta si Biosecuritate</td>
<td>Moldova, Republic of</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY</td>
<td>NaN</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>Yes</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>Yes</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>Double blind peer review</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>No</td>
<td>NaN</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>10</td>
<td>No</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>DOI, UDC</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
<td>https://journal.ohrm.bba.md/index.php/journal-...</td>
<td>NaN</td>
<td>NaN</td>
<td>R|Q</td>
<td>Medicine | Science</td>
<td>No</td>
<td>2021-03-04T16:06:58Z</td>
<td>2021-03-04T16:06:58Z</td>
<td>4</td>
<td>2021-03-04T20:46:57Z</td>
</tr>
<tr>
<td>16027</td>
<td>فصلنامه پژوهش‌های مدیریت منابع انسانی</td>
<td>https://hrmj.ihu.ac.ir/?lang=en</td>
<td>https://doaj.org/toc/87d44ffb6ff849b18d5ddce9c...</td>
<td>Journal of Research in Human Resources Management</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>human resources management</td>
<td>Persian</td>
<td>Imam Hussein University</td>
<td>Iran, Islamic Republic of</td>
<td>NaN</td>
<td>NaN</td>
<td>CC BY</td>
<td>NaN</td>
<td>https://hrmj.ihu.ac.ir/journal/about?lang=en</td>
<td>NaN</td>
<td>NaN</td>
<td>Yes</td>
<td>https://hrmj.ihu.ac.ir/journal/about?lang=en</td>
<td>Double blind peer review</td>
<td>https://hrmj.ihu.ac.ir/journal/process?lang=en</td>
<td>No</td>
<td>NaN</td>
<td>https://hrmj.ihu.ac.ir/journal/aim_scope?lang=en</td>
<td>https://hrmj.ihu.ac.ir/journal/editorial.board...</td>
<td>https://hrmj.ihu.ac.ir/journal/authors.note?la...</td>
<td>20</td>
<td>No</td>
<td>https://hrmj.ihu.ac.ir/?lang=en</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>No</td>
<td>No</td>
<td>Yes</td>
<td>https://hrmj.ihu.ac.ir/?lang=en</td>
<td>NaN</td>
<td>NaN</td>
<td>HF5549-5549.5</td>
<td>Social Sciences: Commerce: Business: Personnel...</td>
<td>No</td>
<td>2021-01-20T11:27:05Z</td>
<td>2021-01-20T11:27:05Z</td>
<td>0</td>
<td>NaN</td>
</tr>
<tr>
<td>16028</td>
<td>Science of Tsunami Hazards</td>
<td>http://tsunamisociety.org/</td>
<td>https://doaj.org/toc/a4f06be11f4f4db489dc034c7...</td>
<td>NaN</td>
<td>8755-6839</td>
<td>NaN</td>
<td>tsunamis, tsunami warning systems, earthquakes...</td>
<td>English</td>
<td>Tsunami Society International</td>
<td>United States</td>
<td>Tsunami Society International</td>
<td>NaN</td>
<td>CC BY</td>
<td>NaN</td>
<td>http://tsunamisociety.org/InstructionsAuthors....</td>
<td>NaN</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>Peer review</td>
<td>http://tsunamisociety.org/PeerReview.html</td>
<td>No</td>
<td>NaN</td>
<td>http://tsunamisociety.org/AboutUs.html</td>
<td>http://tsunamisociety.org/EditorialBoard.html</td>
<td>http://tsunamisociety.org/InstructionsAuthors....</td>
<td>12</td>
<td>No</td>
<td>http://tsunamisociety.org/InstructionsAuthors....</td>
<td>NaN</td>
<td>No</td>
<td>NaN</td>
<td>Yes</td>
<td>http://tsunamisociety.org/InstructionsAuthors....</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Yes</td>
<td>http://tsunamisociety.org/AboutUs.html</td>
<td>NaN</td>
<td>NaN</td>
<td>GC1-1581</td>
<td>Geography. Anthropology. Recreation: Oceanography</td>
<td>No</td>
<td>2009-04-16T17:40:30Z</td>
<td>2016-07-21T16:09:38Z</td>
<td>239</td>
<td>2021-02-27T01:00:51Z</td>
</tr>
</tbody>
</table>
<p>16029 rows × 53 columns</p>
</div>
```python
# garder les lignes avec APC
doaj_apc = doaj.loc[doaj['APC'] == 'Yes'][['Journal ISSN (print version)', 'Journal EISSN (online version)', 'APC amount']]
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
<th>APC amount</th>
</tr>
</thead>
<tbody>
<tr>
<td>2</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
</tr>
<tr>
<td>4</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
</tr>
<tr>
<td>5</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
</tr>
<tr>
<td>6</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
</tr>
<tr>
<td>11</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16002</td>
<td>NaN</td>
<td>2722-1253</td>
<td>200 USD</td>
</tr>
<tr>
<td>16004</td>
<td>NaN</td>
<td>2722-7235</td>
<td>35 USD</td>
</tr>
<tr>
<td>16005</td>
<td>2722-9688</td>
<td>2722-9696</td>
<td>500000 IDR</td>
</tr>
<tr>
<td>16007</td>
<td>NaN</td>
<td>2723-1097</td>
<td>100000 IDR</td>
</tr>
<tr>
<td>16022</td>
<td>2765-0189</td>
<td>2765-0235</td>
<td>700 USD</td>
</tr>
</tbody>
</table>
<p>4462 rows × 3 columns</p>
</div>
```python
# garder les lignes avec APC no
doaj_apc_no = doaj.loc[doaj['APC'] == 'No'][['Journal ISSN (print version)', 'Journal EISSN (online version)']]
doaj_apc_no
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-3765</td>
<td>1678-2690</td>
</tr>
<tr>
<td>1</td>
<td>0001-494X</td>
<td>2282-0035</td>
</tr>
<tr>
<td>3</td>
<td>0001-6012</td>
<td>2215-5856</td>
</tr>
<tr>
<td>7</td>
<td>0001-7019</td>
<td>1846-0410</td>
</tr>
<tr>
<td>8</td>
<td>0002-0397</td>
<td>1868-6869</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>11567 rows × 2 columns</p>
</div>
```python
# attribuer la valeur 0
doaj_apc_no['APC amount'] = 0
doaj_apc_no
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
<th>APC amount</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-3765</td>
<td>1678-2690</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>0001-494X</td>
<td>2282-0035</td>
<td>0</td>
</tr>
<tr>
<td>3</td>
<td>0001-6012</td>
<td>2215-5856</td>
<td>0</td>
</tr>
<tr>
<td>7</td>
<td>0001-7019</td>
<td>1846-0410</td>
<td>0</td>
</tr>
<tr>
<td>8</td>
<td>0002-0397</td>
<td>1868-6869</td>
<td>0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
</tr>
</tbody>
</table>
<p>11567 rows × 3 columns</p>
</div>
```python
# ajout à la table des APC
doaj_apc = doaj_apc.append(doaj_apc_no, ignore_index=True)
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
<th>APC amount</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
</tr>
</tbody>
</table>
<p>16029 rows × 3 columns</p>
</div>
```python
# découpage du prix en 'amount' et 'symbol'
doaj_apc[['amount', 'symbol']] = doaj_apc['APC amount'].str.split(' ', n=1, expand=True)
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
<td>1600</td>
<td>EUR</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
<td>1500</td>
<td>USD</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
<td>520</td>
<td>EUR</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
<td>3500</td>
<td>USD</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>16029 rows × 5 columns</p>
</div>
```python
doaj_apc.loc[doaj_apc['APC amount'] == 0, 'amount'] = 0
doaj_apc.loc[doaj_apc['APC amount'] == 0, 'symbol'] = ''
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
<td>1600</td>
<td>EUR</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
<td>1500</td>
<td>USD</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
<td>520</td>
<td>EUR</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
<td>3500</td>
<td>USD</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
<td>0</td>
<td></td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
<td>0</td>
<td></td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
<td>0</td>
<td></td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
<td>0</td>
<td></td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
</tr>
</tbody>
</table>
<p>16029 rows × 5 columns</p>
</div>
```python
# ajouter les champs manquants
doaj_apc['cost_factor_type'] = 1
doaj_apc['comment'] = 'Source: DOAJ'
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>Journal ISSN (print version)</th>
<th>Journal EISSN (online version)</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
<th>cost_factor_type</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
<td>1600</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
<td>1500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
<td>520</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
<td>3500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
</tbody>
</table>
<p>16029 rows × 7 columns</p>
</div>
```python
# renommer les champs
doaj_apc = doaj_apc.rename(columns = {'Journal ISSN (print version)' : 'issn_print', 'Journal EISSN (online version)' : 'issn_electronic'})
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
<th>cost_factor_type</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
<td>1600</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
<td>1500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
<td>520</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
<td>3500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
</tr>
</tbody>
</table>
<p>16029 rows × 7 columns</p>
</div>
```python
# ajout du issn
doaj_apc['issn'] = doaj_apc['issn_electronic']
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
<th>cost_factor_type</th>
<th>comment</th>
<th>issn</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
<td>1600</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>1651-2057</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2353-074X</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
<td>1500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>1873-6297</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
<td>520</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2083-9480</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
<td>3500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2327-9788</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2765-8031</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2766-1652</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2587-3466</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2645-5072</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>16029 rows × 8 columns</p>
</div>
```python
doaj_apc.loc[doaj_apc['issn'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
<th>cost_factor_type</th>
<th>comment</th>
<th>issn</th>
</tr>
</thead>
<tbody>
<tr>
<td>12</td>
<td>0013-9998</td>
<td>NaN</td>
<td>350 EUR</td>
<td>350</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>14</td>
<td>0015-4040</td>
<td>NaN</td>
<td>747 USD</td>
<td>747</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>17</td>
<td>0017-0011</td>
<td>NaN</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>29</td>
<td>0026-1165</td>
<td>NaN</td>
<td>220000 JPY</td>
<td>220000</td>
<td>JPY</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>30</td>
<td>0026-279X</td>
<td>NaN</td>
<td>350 USD</td>
<td>350</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>15867</td>
<td>2676-5357</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>15892</td>
<td>2686-9594</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>15937</td>
<td>2701-1569</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>15974</td>
<td>2709-8370</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>1461 rows × 8 columns</p>
</div>
```python
# ajout du issnp quand c'est vide
doaj_apc.loc[doaj_apc['issn'].isna(), 'issn'] = doaj_apc['issn_print']
doaj_apc.loc[doaj_apc['issn'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
<th>cost_factor_type</th>
<th>comment</th>
<th>issn</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
doaj_apc = pd.merge(doaj_apc, issns, on='issn', how='left')
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
<th>cost_factor_type</th>
<th>comment</th>
<th>issn</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
<td>1600</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>1651-2057</td>
<td>0001-5555</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2353-074X</td>
<td>0001-625X</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
<td>1500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>1873-6297</td>
<td>0001-6918</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
<td>520</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2083-9480</td>
<td>0001-6977</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
<td>3500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2327-9788</td>
<td>0003-1062</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2765-8031</td>
<td>NaN</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2766-1652</td>
<td>NaN</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2587-3466</td>
<td>NaN</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2645-5072</td>
<td>NaN</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>8755-6839</td>
<td>8755-6839</td>
</tr>
</tbody>
</table>
<p>16029 rows × 9 columns</p>
</div>
```python
# renommer les colonnes
doaj_apc = doaj_apc.rename(columns={'issnl' : 'issn_link'})
doaj_apc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>APC amount</th>
<th>amount</th>
<th>symbol</th>
<th>cost_factor_type</th>
<th>comment</th>
<th>issn</th>
<th>issn_link</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0001-5555</td>
<td>1651-2057</td>
<td>1600 EUR</td>
<td>1600</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>1651-2057</td>
<td>0001-5555</td>
</tr>
<tr>
<td>1</td>
<td>0001-625X</td>
<td>2353-074X</td>
<td>400 EUR</td>
<td>400</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2353-074X</td>
<td>0001-625X</td>
</tr>
<tr>
<td>2</td>
<td>0001-6918</td>
<td>1873-6297</td>
<td>1500 USD</td>
<td>1500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>1873-6297</td>
<td>0001-6918</td>
</tr>
<tr>
<td>3</td>
<td>0001-6977</td>
<td>2083-9480</td>
<td>520 EUR</td>
<td>520</td>
<td>EUR</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2083-9480</td>
<td>0001-6977</td>
</tr>
<tr>
<td>4</td>
<td>0003-1062</td>
<td>2327-9788</td>
<td>3500 USD</td>
<td>3500</td>
<td>USD</td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2327-9788</td>
<td>0003-1062</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>16024</td>
<td>NaN</td>
<td>2765-8031</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2765-8031</td>
<td>NaN</td>
</tr>
<tr>
<td>16025</td>
<td>NaN</td>
<td>2766-1652</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2766-1652</td>
<td>NaN</td>
</tr>
<tr>
<td>16026</td>
<td>2887-3458</td>
<td>2587-3466</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2587-3466</td>
<td>NaN</td>
</tr>
<tr>
<td>16027</td>
<td>8254-8002</td>
<td>2645-5072</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>2645-5072</td>
<td>NaN</td>
</tr>
<tr>
<td>16028</td>
<td>8755-6839</td>
<td>NaN</td>
<td>0</td>
<td>0</td>
<td></td>
<td>1</td>
<td>Source: DOAJ</td>
<td>8755-6839</td>
<td>8755-6839</td>
</tr>
</tbody>
</table>
<p>16029 rows × 9 columns</p>
</div>
### Ajout des APCs depuis la base Journal Database (Zurich Open Repository and Archive)
https://www.jdb.uzh.ch/
```python
# JDB base de Zurich
jdb = pd.read_csv('zora/jdb_apcs.tsv', encoding='utf-8', header=0, sep='\t')
jdb
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>apc_fee</th>
<th>apc_currency</th>
<th>apc_date</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2490</td>
<td>USD</td>
<td>2018</td>
</tr>
<tr>
<td>1</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2950</td>
<td>USD</td>
<td>2020</td>
</tr>
<tr>
<td>2</td>
<td>10002</td>
<td>0952-3383</td>
<td>1467-8578</td>
<td>0952-3383</td>
<td>2500</td>
<td>EUR</td>
<td>2017</td>
</tr>
<tr>
<td>3</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2018</td>
</tr>
<tr>
<td>4</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2020</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>11575</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3000</td>
<td>USD</td>
<td>2015</td>
</tr>
<tr>
<td>11576</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2016</td>
</tr>
<tr>
<td>11577</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2017</td>
</tr>
<tr>
<td>11578</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3750</td>
<td>USD</td>
<td>2018</td>
</tr>
<tr>
<td>11579</td>
<td>9995</td>
<td>0816-4649</td>
<td>1465-3303</td>
<td>0816-4649</td>
<td>2950</td>
<td>USD</td>
<td>2017</td>
</tr>
</tbody>
</table>
<p>11580 rows × 7 columns</p>
</div>
```python
# renommer l'id
jdb = jdb.rename(columns = {'id' : 'jdb_id'})
jdb
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>jdb_id</th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>apc_fee</th>
<th>apc_currency</th>
<th>apc_date</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2490</td>
<td>USD</td>
<td>2018</td>
</tr>
<tr>
<td>1</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2950</td>
<td>USD</td>
<td>2020</td>
</tr>
<tr>
<td>2</td>
<td>10002</td>
<td>0952-3383</td>
<td>1467-8578</td>
<td>0952-3383</td>
<td>2500</td>
<td>EUR</td>
<td>2017</td>
</tr>
<tr>
<td>3</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2018</td>
</tr>
<tr>
<td>4</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2020</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>11575</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3000</td>
<td>USD</td>
<td>2015</td>
</tr>
<tr>
<td>11576</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2016</td>
</tr>
<tr>
<td>11577</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2017</td>
</tr>
<tr>
<td>11578</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3750</td>
<td>USD</td>
<td>2018</td>
</tr>
<tr>
<td>11579</td>
<td>9995</td>
<td>0816-4649</td>
<td>1465-3303</td>
<td>0816-4649</td>
<td>2950</td>
<td>USD</td>
<td>2017</td>
</tr>
</tbody>
</table>
<p>11580 rows × 7 columns</p>
</div>
```python
# ajouter les champs manquants
jdb['cost_factor_type'] = 1
jdb['comment'] = 'Source: JDB (' + jdb['apc_date'].astype(str) + ')'
jdb
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>jdb_id</th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>apc_fee</th>
<th>apc_currency</th>
<th>apc_date</th>
<th>cost_factor_type</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2490</td>
<td>USD</td>
<td>2018</td>
<td>1</td>
<td>Source: JDB (2018)</td>
</tr>
<tr>
<td>1</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2950</td>
<td>USD</td>
<td>2020</td>
<td>1</td>
<td>Source: JDB (2020)</td>
</tr>
<tr>
<td>2</td>
<td>10002</td>
<td>0952-3383</td>
<td>1467-8578</td>
<td>0952-3383</td>
<td>2500</td>
<td>EUR</td>
<td>2017</td>
<td>1</td>
<td>Source: JDB (2017)</td>
</tr>
<tr>
<td>3</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2018</td>
<td>1</td>
<td>Source: JDB (2018)</td>
</tr>
<tr>
<td>4</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2020</td>
<td>1</td>
<td>Source: JDB (2020)</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>11575</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3000</td>
<td>USD</td>
<td>2015</td>
<td>1</td>
<td>Source: JDB (2015)</td>
</tr>
<tr>
<td>11576</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2016</td>
<td>1</td>
<td>Source: JDB (2016)</td>
</tr>
<tr>
<td>11577</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2017</td>
<td>1</td>
<td>Source: JDB (2017)</td>
</tr>
<tr>
<td>11578</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3750</td>
<td>USD</td>
<td>2018</td>
<td>1</td>
<td>Source: JDB (2018)</td>
</tr>
<tr>
<td>11579</td>
<td>9995</td>
<td>0816-4649</td>
<td>1465-3303</td>
<td>0816-4649</td>
<td>2950</td>
<td>USD</td>
<td>2017</td>
<td>1</td>
<td>Source: JDB (2017)</td>
</tr>
</tbody>
</table>
<p>11580 rows × 9 columns</p>
</div>
```python
# renommer les champs
jdb = jdb.rename(columns = {'apc_fee' : 'amount', 'apc_currency' : 'symbol'})
jdb
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>jdb_id</th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>amount</th>
<th>symbol</th>
<th>apc_date</th>
<th>cost_factor_type</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2490</td>
<td>USD</td>
<td>2018</td>
<td>1</td>
<td>Source: JDB (2018)</td>
</tr>
<tr>
<td>1</td>
<td>10001</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>2950</td>
<td>USD</td>
<td>2020</td>
<td>1</td>
<td>Source: JDB (2020)</td>
</tr>
<tr>
<td>2</td>
<td>10002</td>
<td>0952-3383</td>
<td>1467-8578</td>
<td>0952-3383</td>
<td>2500</td>
<td>EUR</td>
<td>2017</td>
<td>1</td>
<td>Source: JDB (2017)</td>
</tr>
<tr>
<td>3</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2018</td>
<td>1</td>
<td>Source: JDB (2018)</td>
</tr>
<tr>
<td>4</td>
<td>10005</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1958</td>
<td>USD</td>
<td>2020</td>
<td>1</td>
<td>Source: JDB (2020)</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>11575</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3000</td>
<td>USD</td>
<td>2015</td>
<td>1</td>
<td>Source: JDB (2015)</td>
</tr>
<tr>
<td>11576</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2016</td>
<td>1</td>
<td>Source: JDB (2016)</td>
</tr>
<tr>
<td>11577</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3550</td>
<td>USD</td>
<td>2017</td>
<td>1</td>
<td>Source: JDB (2017)</td>
</tr>
<tr>
<td>11578</td>
<td>9986</td>
<td>1549-9634</td>
<td>1549-9642</td>
<td>1549-9634</td>
<td>3750</td>
<td>USD</td>
<td>2018</td>
<td>1</td>
<td>Source: JDB (2018)</td>
</tr>
<tr>
<td>11579</td>
<td>9995</td>
<td>0816-4649</td>
<td>1465-3303</td>
<td>0816-4649</td>
<td>2950</td>
<td>USD</td>
<td>2017</td>
<td>1</td>
<td>Source: JDB (2017)</td>
</tr>
</tbody>
</table>
<p>11580 rows × 9 columns</p>
</div>
```python
jdb = jdb.drop_duplicates(subset='jdb_id', keep='last')
```
```python
# import openapc avec les valeurs max
openapc = pd.read_csv('openapc/open_apc_max.tsv', encoding='utf-8', header=0, sep='\t')
openapc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>period</th>
<th>euro</th>
<th>issn</th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>issn_l</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>2018</td>
<td>1385.36</td>
<td>0001-0782</td>
<td>0001-0782</td>
<td>NaN</td>
<td>0001-0782</td>
</tr>
<tr>
<td>1</td>
<td>2018</td>
<td>1811.88</td>
<td>0001-1452</td>
<td>0001-1452</td>
<td>1533-385X</td>
<td>0001-1452</td>
</tr>
<tr>
<td>2</td>
<td>2020</td>
<td>1826.49</td>
<td>0001-1452</td>
<td>0001-1452</td>
<td>1533-385X</td>
<td>0001-1452</td>
</tr>
<tr>
<td>3</td>
<td>2013</td>
<td>2238.76</td>
<td>0001-1541</td>
<td>NaN</td>
<td>NaN</td>
<td>0001-1541</td>
</tr>
<tr>
<td>4</td>
<td>2014</td>
<td>1887.86</td>
<td>0001-1541</td>
<td>NaN</td>
<td>NaN</td>
<td>0001-1541</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>23793</td>
<td>2013</td>
<td>2400.00</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23794</td>
<td>2014</td>
<td>1822.49</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23795</td>
<td>2016</td>
<td>1762.69</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23796</td>
<td>2017</td>
<td>3248.31</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23797</td>
<td>2019</td>
<td>2913.11</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
</tbody>
</table>
<p>23798 rows × 6 columns</p>
</div>
```python
# renommer les champs
openapc = openapc.rename(columns = {'period' : 'apc_date', 'issn_l' : 'issn_link', 'euro' : 'amount'})
openapc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>apc_date</th>
<th>amount</th>
<th>issn</th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>issn_link</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>2018</td>
<td>1385.36</td>
<td>0001-0782</td>
<td>0001-0782</td>
<td>NaN</td>
<td>0001-0782</td>
</tr>
<tr>
<td>1</td>
<td>2018</td>
<td>1811.88</td>
<td>0001-1452</td>
<td>0001-1452</td>
<td>1533-385X</td>
<td>0001-1452</td>
</tr>
<tr>
<td>2</td>
<td>2020</td>
<td>1826.49</td>
<td>0001-1452</td>
<td>0001-1452</td>
<td>1533-385X</td>
<td>0001-1452</td>
</tr>
<tr>
<td>3</td>
<td>2013</td>
<td>2238.76</td>
<td>0001-1541</td>
<td>NaN</td>
<td>NaN</td>
<td>0001-1541</td>
</tr>
<tr>
<td>4</td>
<td>2014</td>
<td>1887.86</td>
<td>0001-1541</td>
<td>NaN</td>
<td>NaN</td>
<td>0001-1541</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>23793</td>
<td>2013</td>
<td>2400.00</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23794</td>
<td>2014</td>
<td>1822.49</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23795</td>
<td>2016</td>
<td>1762.69</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23796</td>
<td>2017</td>
<td>3248.31</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
<tr>
<td>23797</td>
<td>2019</td>
<td>2913.11</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
</tr>
</tbody>
</table>
<p>23798 rows × 6 columns</p>
</div>
```python
# ajouter le lien avec le type et le symbole
openapc['cost_factor_type'] = 1
openapc['jdb_id'] = np.nan
openapc['symbol'] = 'EUR'
openapc['comment'] = 'Source: OpenAPC (' + openapc['apc_date'].astype(str) + ')'
openapc
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>apc_date</th>
<th>amount</th>
<th>issn</th>
<th>issn_print</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>cost_factor_type</th>
<th>jdb_id</th>
<th>symbol</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>2018</td>
<td>1385.36</td>
<td>0001-0782</td>
<td>0001-0782</td>
<td>NaN</td>
<td>0001-0782</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2018)</td>
</tr>
<tr>
<td>1</td>
<td>2018</td>
<td>1811.88</td>
<td>0001-1452</td>
<td>0001-1452</td>
<td>1533-385X</td>
<td>0001-1452</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2018)</td>
</tr>
<tr>
<td>2</td>
<td>2020</td>
<td>1826.49</td>
<td>0001-1452</td>
<td>0001-1452</td>
<td>1533-385X</td>
<td>0001-1452</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2020)</td>
</tr>
<tr>
<td>3</td>
<td>2013</td>
<td>2238.76</td>
<td>0001-1541</td>
<td>NaN</td>
<td>NaN</td>
<td>0001-1541</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2013)</td>
</tr>
<tr>
<td>4</td>
<td>2014</td>
<td>1887.86</td>
<td>0001-1541</td>
<td>NaN</td>
<td>NaN</td>
<td>0001-1541</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2014)</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>23793</td>
<td>2013</td>
<td>2400.00</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2013)</td>
</tr>
<tr>
<td>23794</td>
<td>2014</td>
<td>1822.49</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2014)</td>
</tr>
<tr>
<td>23795</td>
<td>2016</td>
<td>1762.69</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>23796</td>
<td>2017</td>
<td>3248.31</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
</tr>
<tr>
<td>23797</td>
<td>2019</td>
<td>2913.11</td>
<td>8756-7938</td>
<td>NaN</td>
<td>NaN</td>
<td>1520-6033</td>
<td>1</td>
<td>NaN</td>
<td>EUR</td>
<td>Source: OpenAPC (2019)</td>
</tr>
</tbody>
</table>
<p>23798 rows × 10 columns</p>
</div>
```python
# ajout des lignes de openapc
jdb = jdb.append(openapc, ignore_index=True)
jdb
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\pandas\core\frame.py:7123: FutureWarning: Sorting because non-concatenation axis is not aligned. A future version
of pandas will change to not sort by default.
To accept the future behavior, pass 'sort=False'.
To retain the current behavior and silence the warning, pass 'sort=True'.
sort=sort,
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>2950.00</td>
<td>2020</td>
<td>Source: JDB (2020)</td>
<td>1</td>
<td>NaN</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>10001.0</td>
<td>USD</td>
</tr>
<tr>
<td>1</td>
<td>2500.00</td>
<td>2017</td>
<td>Source: JDB (2017)</td>
<td>1</td>
<td>NaN</td>
<td>1467-8578</td>
<td>0952-3383</td>
<td>0952-3383</td>
<td>10002.0</td>
<td>EUR</td>
</tr>
<tr>
<td>2</td>
<td>1958.00</td>
<td>2020</td>
<td>Source: JDB (2020)</td>
<td>1</td>
<td>NaN</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>10005.0</td>
<td>USD</td>
</tr>
<tr>
<td>3</td>
<td>1370.00</td>
<td>2020</td>
<td>Source: JDB (2020)</td>
<td>1</td>
<td>NaN</td>
<td>1479-5876</td>
<td>1479-5876</td>
<td>NaN</td>
<td>10015.0</td>
<td>GBP</td>
</tr>
<tr>
<td>4</td>
<td>2200.00</td>
<td>2017</td>
<td>Source: JDB (2017)</td>
<td>1</td>
<td>NaN</td>
<td>1572-8552</td>
<td>1383-4924</td>
<td>1383-4924</td>
<td>10023.0</td>
<td>EUR</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>29947</td>
<td>2400.00</td>
<td>2013</td>
<td>Source: OpenAPC (2013)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29948</td>
<td>1822.49</td>
<td>2014</td>
<td>Source: OpenAPC (2014)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29949</td>
<td>1762.69</td>
<td>2016</td>
<td>Source: OpenAPC (2016)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29950</td>
<td>3248.31</td>
<td>2017</td>
<td>Source: OpenAPC (2017)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29951</td>
<td>2913.11</td>
<td>2019</td>
<td>Source: OpenAPC (2019)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
</tbody>
</table>
<p>29952 rows × 10 columns</p>
</div>
```python
# supprimer les doublons par issnl et date
jdb = jdb.drop_duplicates(subset=['issn_link', 'apc_date'], keep='first')
jdb
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>2950.00</td>
<td>2020</td>
<td>Source: JDB (2020)</td>
<td>1</td>
<td>NaN</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>1662-5161</td>
<td>10001.0</td>
<td>USD</td>
</tr>
<tr>
<td>1</td>
<td>2500.00</td>
<td>2017</td>
<td>Source: JDB (2017)</td>
<td>1</td>
<td>NaN</td>
<td>1467-8578</td>
<td>0952-3383</td>
<td>0952-3383</td>
<td>10002.0</td>
<td>EUR</td>
</tr>
<tr>
<td>2</td>
<td>1958.00</td>
<td>2020</td>
<td>Source: JDB (2020)</td>
<td>1</td>
<td>NaN</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>1179-7258</td>
<td>10005.0</td>
<td>USD</td>
</tr>
<tr>
<td>3</td>
<td>1370.00</td>
<td>2020</td>
<td>Source: JDB (2020)</td>
<td>1</td>
<td>NaN</td>
<td>1479-5876</td>
<td>1479-5876</td>
<td>NaN</td>
<td>10015.0</td>
<td>GBP</td>
</tr>
<tr>
<td>4</td>
<td>2200.00</td>
<td>2017</td>
<td>Source: JDB (2017)</td>
<td>1</td>
<td>NaN</td>
<td>1572-8552</td>
<td>1383-4924</td>
<td>1383-4924</td>
<td>10023.0</td>
<td>EUR</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>29947</td>
<td>2400.00</td>
<td>2013</td>
<td>Source: OpenAPC (2013)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29948</td>
<td>1822.49</td>
<td>2014</td>
<td>Source: OpenAPC (2014)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29949</td>
<td>1762.69</td>
<td>2016</td>
<td>Source: OpenAPC (2016)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29950</td>
<td>3248.31</td>
<td>2017</td>
<td>Source: OpenAPC (2017)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>29951</td>
<td>2913.11</td>
<td>2019</td>
<td>Source: OpenAPC (2019)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
</tbody>
</table>
<p>29478 rows × 10 columns</p>
</div>
```python
# ajout de DOAJ
cost_factor = doaj_apc.append(jdb, ignore_index=True)
cost_factor
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>APC amount</th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1600 EUR</td>
<td>1600</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>1651-2057</td>
<td>1651-2057</td>
<td>0001-5555</td>
<td>0001-5555</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>1</td>
<td>400 EUR</td>
<td>400</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2353-074X</td>
<td>2353-074X</td>
<td>0001-625X</td>
<td>0001-625X</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>2</td>
<td>1500 USD</td>
<td>1500</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>1873-6297</td>
<td>1873-6297</td>
<td>0001-6918</td>
<td>0001-6918</td>
<td>NaN</td>
<td>USD</td>
</tr>
<tr>
<td>3</td>
<td>520 EUR</td>
<td>520</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2083-9480</td>
<td>2083-9480</td>
<td>0001-6977</td>
<td>0001-6977</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>4</td>
<td>3500 USD</td>
<td>3500</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2327-9788</td>
<td>2327-9788</td>
<td>0003-1062</td>
<td>0003-1062</td>
<td>NaN</td>
<td>USD</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>45502</td>
<td>NaN</td>
<td>2400</td>
<td>2013</td>
<td>Source: OpenAPC (2013)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>45503</td>
<td>NaN</td>
<td>1822.49</td>
<td>2014</td>
<td>Source: OpenAPC (2014)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>45504</td>
<td>NaN</td>
<td>1762.69</td>
<td>2016</td>
<td>Source: OpenAPC (2016)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>45505</td>
<td>NaN</td>
<td>3248.31</td>
<td>2017</td>
<td>Source: OpenAPC (2017)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>45506</td>
<td>NaN</td>
<td>2913.11</td>
<td>2019</td>
<td>Source: OpenAPC (2019)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
</tbody>
</table>
<p>45507 rows × 11 columns</p>
</div>
```python
# test issnl
cost_factor.loc[cost_factor['issn_link'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>APC amount</th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>13</td>
<td>540 PLN</td>
<td>540</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2544-8552</td>
<td>2544-8552</td>
<td>NaN</td>
<td>0014-8261</td>
<td>NaN</td>
<td>PLN</td>
</tr>
<tr>
<td>62</td>
<td>100 USD</td>
<td>100</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2545-3149</td>
<td>2545-3149</td>
<td>NaN</td>
<td>0079-4252</td>
<td>NaN</td>
<td>USD</td>
</tr>
<tr>
<td>129</td>
<td>423 EUR</td>
<td>423</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2605-3322</td>
<td>2605-3322</td>
<td>NaN</td>
<td>0212-9426</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>133</td>
<td>200 EUR</td>
<td>200</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2603-5987</td>
<td>2603-5987</td>
<td>NaN</td>
<td>0214-9877</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>140</td>
<td>800000 IDR</td>
<td>800000</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2621-1122</td>
<td>2621-1122</td>
<td>NaN</td>
<td>0216-3438</td>
<td>NaN</td>
<td>IDR</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>26703</td>
<td>NaN</td>
<td>3873.61</td>
<td>2016</td>
<td>Source: OpenAPC (2016)</td>
<td>1</td>
<td>0263-8762</td>
<td>NaN</td>
<td>NaN</td>
<td>0263-8762</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>26704</td>
<td>NaN</td>
<td>2557.73</td>
<td>2017</td>
<td>Source: OpenAPC (2017)</td>
<td>1</td>
<td>0263-8762</td>
<td>NaN</td>
<td>NaN</td>
<td>0263-8762</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>26705</td>
<td>NaN</td>
<td>3564.25</td>
<td>2018</td>
<td>Source: OpenAPC (2018)</td>
<td>1</td>
<td>0263-8762</td>
<td>NaN</td>
<td>NaN</td>
<td>0263-8762</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>27923</td>
<td>NaN</td>
<td>1130.5</td>
<td>2019</td>
<td>Source: OpenAPC (2019)</td>
<td>1</td>
<td>0342-183X</td>
<td>NaN</td>
<td>NaN</td>
<td>0342-183X</td>
<td>NaN</td>
<td>EUR</td>
</tr>
<tr>
<td>45474</td>
<td>NaN</td>
<td>1690</td>
<td>2020</td>
<td>Source: OpenAPC (2020)</td>
<td>1</td>
<td>2691-9478</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
</tr>
</tbody>
</table>
<p>2500 rows × 11 columns</p>
</div>
```python
# merge avec issnl
cost_factor = pd.merge(cost_factor, issns, on='issn', how='left')
cost_factor
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>APC amount</th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1600 EUR</td>
<td>1600</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>1651-2057</td>
<td>1651-2057</td>
<td>0001-5555</td>
<td>0001-5555</td>
<td>NaN</td>
<td>EUR</td>
<td>0001-5555</td>
</tr>
<tr>
<td>1</td>
<td>400 EUR</td>
<td>400</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2353-074X</td>
<td>2353-074X</td>
<td>0001-625X</td>
<td>0001-625X</td>
<td>NaN</td>
<td>EUR</td>
<td>0001-625X</td>
</tr>
<tr>
<td>2</td>
<td>1500 USD</td>
<td>1500</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>1873-6297</td>
<td>1873-6297</td>
<td>0001-6918</td>
<td>0001-6918</td>
<td>NaN</td>
<td>USD</td>
<td>0001-6918</td>
</tr>
<tr>
<td>3</td>
<td>520 EUR</td>
<td>520</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2083-9480</td>
<td>2083-9480</td>
<td>0001-6977</td>
<td>0001-6977</td>
<td>NaN</td>
<td>EUR</td>
<td>0001-6977</td>
</tr>
<tr>
<td>4</td>
<td>3500 USD</td>
<td>3500</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2327-9788</td>
<td>2327-9788</td>
<td>0003-1062</td>
<td>0003-1062</td>
<td>NaN</td>
<td>USD</td>
<td>0003-1062</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>45502</td>
<td>NaN</td>
<td>2400</td>
<td>2013</td>
<td>Source: OpenAPC (2013)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>1520-6033</td>
</tr>
<tr>
<td>45503</td>
<td>NaN</td>
<td>1822.49</td>
<td>2014</td>
<td>Source: OpenAPC (2014)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>1520-6033</td>
</tr>
<tr>
<td>45504</td>
<td>NaN</td>
<td>1762.69</td>
<td>2016</td>
<td>Source: OpenAPC (2016)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>1520-6033</td>
</tr>
<tr>
<td>45505</td>
<td>NaN</td>
<td>3248.31</td>
<td>2017</td>
<td>Source: OpenAPC (2017)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>1520-6033</td>
</tr>
<tr>
<td>45506</td>
<td>NaN</td>
<td>2913.11</td>
<td>2019</td>
<td>Source: OpenAPC (2019)</td>
<td>1</td>
<td>8756-7938</td>
<td>NaN</td>
<td>1520-6033</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>1520-6033</td>
</tr>
</tbody>
</table>
<p>45507 rows × 12 columns</p>
</div>
```python
# test issnl
cost_factor.loc[cost_factor['issnl'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>APC amount</th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
<tr>
<td>13</td>
<td>540 PLN</td>
<td>540</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2544-8552</td>
<td>2544-8552</td>
<td>NaN</td>
<td>0014-8261</td>
<td>NaN</td>
<td>PLN</td>
<td>NaN</td>
</tr>
<tr>
<td>62</td>
<td>100 USD</td>
<td>100</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2545-3149</td>
<td>2545-3149</td>
<td>NaN</td>
<td>0079-4252</td>
<td>NaN</td>
<td>USD</td>
<td>NaN</td>
</tr>
<tr>
<td>129</td>
<td>423 EUR</td>
<td>423</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2605-3322</td>
<td>2605-3322</td>
<td>NaN</td>
<td>0212-9426</td>
<td>NaN</td>
<td>EUR</td>
<td>NaN</td>
</tr>
<tr>
<td>133</td>
<td>200 EUR</td>
<td>200</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2603-5987</td>
<td>2603-5987</td>
<td>NaN</td>
<td>0214-9877</td>
<td>NaN</td>
<td>EUR</td>
<td>NaN</td>
</tr>
<tr>
<td>140</td>
<td>800000 IDR</td>
<td>800000</td>
<td>NaN</td>
<td>Source: DOAJ</td>
<td>1</td>
<td>2621-1122</td>
<td>2621-1122</td>
<td>NaN</td>
<td>0216-3438</td>
<td>NaN</td>
<td>IDR</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>45472</td>
<td>NaN</td>
<td>698.65</td>
<td>2019</td>
<td>Source: OpenAPC (2019)</td>
<td>1</td>
<td>2690-0009</td>
<td>2690-0009</td>
<td>2690-0009</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>NaN</td>
</tr>
<tr>
<td>45473</td>
<td>NaN</td>
<td>754.67</td>
<td>2019</td>
<td>Source: OpenAPC (2019)</td>
<td>1</td>
<td>2690-3202</td>
<td>NaN</td>
<td>2690-3202</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>NaN</td>
</tr>
<tr>
<td>45474</td>
<td>NaN</td>
<td>1690</td>
<td>2020</td>
<td>Source: OpenAPC (2020)</td>
<td>1</td>
<td>2691-9478</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>NaN</td>
</tr>
<tr>
<td>45475</td>
<td>NaN</td>
<td>1523.2</td>
<td>2020</td>
<td>Source: OpenAPC (2020)</td>
<td>1</td>
<td>2699-0016</td>
<td>2699-0016</td>
<td>2699-0016</td>
<td>NaN</td>
<td>NaN</td>
<td>EUR</td>
<td>NaN</td>
</tr>
<tr>
<td>45476</td>
<td>NaN</td>
<td>305</td>
<td>2020</td>
<td>Source: OpenAPC (2020)</td>
<td>1</td>
<td>2704-6192</td>
<td>2704-6192</td>
<td>2280-1855</td>
<td>2280-1855</td>
<td>NaN</td>
<td>EUR</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>8935 rows × 12 columns</p>
</div>
```python
#ajout des issn quand ça manque
cost_factor.loc[cost_factor['issn'].isna(), 'issn'] = cost_factor['issn_print']
cost_factor.loc[cost_factor['issn'].isna(), 'issn'] = cost_factor['issn_electronic']
cost_factor.loc[cost_factor['issn'].isna(), 'issn'] = cost_factor['issn_link']
cost_factor.loc[cost_factor['issn'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>APC amount</th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
#ajout des issnl quand ça manque
cost_factor.loc[cost_factor['issnl'].isna(), 'issnl'] = cost_factor['issn_link']
cost_factor.loc[cost_factor['issnl'].isna(), 'issnl'] = cost_factor['issn_print']
cost_factor.loc[cost_factor['issnl'].isna(), 'issnl'] = cost_factor['issn_electronic']
cost_factor.loc[cost_factor['issnl'].isna(), 'issnl'] = cost_factor['issn']
cost_factor.loc[cost_factor['issnl'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>APC amount</th>
<th>amount</th>
<th>apc_date</th>
<th>comment</th>
<th>cost_factor_type</th>
<th>issn</th>
<th>issn_electronic</th>
<th>issn_link</th>
<th>issn_print</th>
<th>jdb_id</th>
<th>symbol</th>
<th>issnl</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
# prendre les ids pour le merge
cost_factor_ids = cost_factor[['issn', 'issnl', 'cost_factor_type', 'amount', 'symbol', 'comment']]
# cost_factor_ids_1 = cost_factor_ids_1.rename(columns = {'issn_link' : 'issn'})
# cost_factor_ids_2 = cost_factor.loc[cost_factor['issn_electronic'].notna()][['issn_electronic', 'cost_factor_type', 'amount', 'symbol', 'comment']]
# cost_factor_ids_2 = cost_factor_ids_2.rename(columns = {'issn_electronic' : 'issn'})
# cost_factor_ids_3 = cost_factor.loc[cost_factor['issn_print'].notna()][['issn_print', 'cost_factor_type', 'amount', 'symbol', 'comment']]
# cost_factor_ids_3 = cost_factor_ids_3.rename(columns = {'issn_print' : 'issn'})
# cost_factor_ids_4 = cost_factor.loc[cost_factor['issn'].notna()][['issn', 'cost_factor_type', 'amount', 'symbol', 'comment']]
# cost_factor_ids = cost_factor_ids_1.append(cost_factor_ids_2)
# cost_factor_ids = cost_factor_ids.append(cost_factor_ids_3)
# cost_factor_ids = cost_factor_ids.append(cost_factor_ids_4)
cost_factor_ids
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1651-2057</td>
<td>0001-5555</td>
<td>1</td>
<td>1600</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>1</td>
<td>2353-074X</td>
<td>0001-625X</td>
<td>1</td>
<td>400</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>2</td>
<td>1873-6297</td>
<td>0001-6918</td>
<td>1</td>
<td>1500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>3</td>
<td>2083-9480</td>
<td>0001-6977</td>
<td>1</td>
<td>520</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>4</td>
<td>2327-9788</td>
<td>0003-1062</td>
<td>1</td>
<td>3500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>45502</td>
<td>8756-7938</td>
<td>1520-6033</td>
<td>1</td>
<td>2400</td>
<td>EUR</td>
<td>Source: OpenAPC (2013)</td>
</tr>
<tr>
<td>45503</td>
<td>8756-7938</td>
<td>1520-6033</td>
<td>1</td>
<td>1822.49</td>
<td>EUR</td>
<td>Source: OpenAPC (2014)</td>
</tr>
<tr>
<td>45504</td>
<td>8756-7938</td>
<td>1520-6033</td>
<td>1</td>
<td>1762.69</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>45505</td>
<td>8756-7938</td>
<td>1520-6033</td>
<td>1</td>
<td>3248.31</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
</tr>
<tr>
<td>45506</td>
<td>8756-7938</td>
<td>1520-6033</td>
<td>1</td>
<td>2913.11</td>
<td>EUR</td>
<td>Source: OpenAPC (2019)</td>
</tr>
</tbody>
</table>
<p>45507 rows × 6 columns</p>
</div>
```python
# supprimer les doublons et les vides
cost_factor_ids = cost_factor_ids.drop_duplicates(subset=['issnl'])
cost_factor_ids
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1651-2057</td>
<td>0001-5555</td>
<td>1</td>
<td>1600</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>1</td>
<td>2353-074X</td>
<td>0001-625X</td>
<td>1</td>
<td>400</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>2</td>
<td>1873-6297</td>
<td>0001-6918</td>
<td>1</td>
<td>1500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>3</td>
<td>2083-9480</td>
<td>0001-6977</td>
<td>1</td>
<td>520</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>4</td>
<td>2327-9788</td>
<td>0003-1062</td>
<td>1</td>
<td>3500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>45473</td>
<td>2690-3202</td>
<td>2690-3202</td>
<td>1</td>
<td>754.67</td>
<td>EUR</td>
<td>Source: OpenAPC (2019)</td>
</tr>
<tr>
<td>45474</td>
<td>2691-9478</td>
<td>2691-9478</td>
<td>1</td>
<td>1690</td>
<td>EUR</td>
<td>Source: OpenAPC (2020)</td>
</tr>
<tr>
<td>45477</td>
<td>8750-7587</td>
<td>1522-1601</td>
<td>1</td>
<td>2355.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>45481</td>
<td>8755-1209</td>
<td>1944-9208</td>
<td>1</td>
<td>2627.74</td>
<td>EUR</td>
<td>Source: OpenAPC (2013)</td>
</tr>
<tr>
<td>45498</td>
<td>8756-758X</td>
<td>1460-2695</td>
<td>1</td>
<td>2725.08</td>
<td>EUR</td>
<td>Source: OpenAPC (2014)</td>
</tr>
</tbody>
</table>
<p>24018 rows × 6 columns</p>
</div>
```python
# merge dans l'autre sens pour garder que les lignes du fichier
cost_factor_ids = pd.merge(cost_factor_ids, sherpa[['id', 'issnl']], on='issnl', how='left')
cost_factor_ids
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1651-2057</td>
<td>0001-5555</td>
<td>1</td>
<td>1600</td>
<td>EUR</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>2353-074X</td>
<td>0001-625X</td>
<td>1</td>
<td>400</td>
<td>EUR</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>1873-6297</td>
<td>0001-6918</td>
<td>1</td>
<td>1500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>2083-9480</td>
<td>0001-6977</td>
<td>1</td>
<td>520</td>
<td>EUR</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>2327-9788</td>
<td>0003-1062</td>
<td>1</td>
<td>3500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>31397</td>
<td>2690-3202</td>
<td>2690-3202</td>
<td>1</td>
<td>754.67</td>
<td>EUR</td>
<td>Source: OpenAPC (2019)</td>
<td>NaN</td>
</tr>
<tr>
<td>31398</td>
<td>2691-9478</td>
<td>2691-9478</td>
<td>1</td>
<td>1690</td>
<td>EUR</td>
<td>Source: OpenAPC (2020)</td>
<td>NaN</td>
</tr>
<tr>
<td>31399</td>
<td>8750-7587</td>
<td>1522-1601</td>
<td>1</td>
<td>2355.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>NaN</td>
</tr>
<tr>
<td>31400</td>
<td>8755-1209</td>
<td>1944-9208</td>
<td>1</td>
<td>2627.74</td>
<td>EUR</td>
<td>Source: OpenAPC (2013)</td>
<td>NaN</td>
</tr>
<tr>
<td>31401</td>
<td>8756-758X</td>
<td>1460-2695</td>
<td>1</td>
<td>2725.08</td>
<td>EUR</td>
<td>Source: OpenAPC (2014)</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>31402 rows × 7 columns</p>
</div>
```python
# garder les lignes avec merge
cost_factor_ids_all = cost_factor_ids.loc[cost_factor_ids['id'].notnull()]
cost_factor_ids_all
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>23</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1369.0</td>
</tr>
<tr>
<td>24</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1370.0</td>
</tr>
<tr>
<td>25</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1371.0</td>
</tr>
<tr>
<td>26</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1372.0</td>
</tr>
<tr>
<td>31</td>
<td>1536-5964</td>
<td>0025-7974</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>2147.0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>31297</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8591.0</td>
</tr>
<tr>
<td>31298</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8592.0</td>
</tr>
<tr>
<td>31299</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8593.0</td>
</tr>
<tr>
<td>31300</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8594.0</td>
</tr>
<tr>
<td>31301</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8595.0</td>
</tr>
</tbody>
</table>
<p>7964 rows × 7 columns</p>
</div>
```python
# supprimer les doublons
cost_factor_ids_all = cost_factor_ids_all.drop_duplicates(subset=['id'])
cost_factor_ids_all
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>23</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1369.0</td>
</tr>
<tr>
<td>24</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1370.0</td>
</tr>
<tr>
<td>25</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1371.0</td>
</tr>
<tr>
<td>26</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1372.0</td>
</tr>
<tr>
<td>31</td>
<td>1536-5964</td>
<td>0025-7974</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>2147.0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>31297</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8591.0</td>
</tr>
<tr>
<td>31298</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8592.0</td>
</tr>
<tr>
<td>31299</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8593.0</td>
</tr>
<tr>
<td>31300</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8594.0</td>
</tr>
<tr>
<td>31301</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>8595.0</td>
</tr>
</tbody>
</table>
<p>7964 rows × 7 columns</p>
</div>
```python
# supprimer les doublons par issnl
cost_factor_ids_all = cost_factor_ids_all.drop_duplicates(subset=['issnl'])
del cost_factor_ids_all['id']
cost_factor_ids_all
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>23</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>31</td>
<td>1536-5964</td>
<td>0025-7974</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>222</td>
<td>1592-8721</td>
<td>0390-6078</td>
<td>1</td>
<td>2000</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>303</td>
<td>1555-3892</td>
<td>0963-6897</td>
<td>1</td>
<td>2750</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>402</td>
<td>1095-9572</td>
<td>1053-8119</td>
<td>1</td>
<td>3000</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>31237</td>
<td>2469-9926</td>
<td>2469-9926</td>
<td>1</td>
<td>2156.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2015)</td>
</tr>
<tr>
<td>31242</td>
<td>2469-9950</td>
<td>2469-9950</td>
<td>1</td>
<td>2143.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>31248</td>
<td>2470-0010</td>
<td>2470-0010</td>
<td>1</td>
<td>1763.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>31253</td>
<td>2470-0045</td>
<td>2470-0045</td>
<td>1</td>
<td>1211.45</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>31297</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
</tr>
</tbody>
</table>
<p>580 rows × 6 columns</p>
</div>
```python
# convertir l'index en id
cost_factor_ids_all = cost_factor_ids_all.reset_index()
# ajout de l'id avec l'index + 1
cost_factor_ids_all['cost_factor'] = cost_factor_ids_all['index'] + id_start
del cost_factor_ids_all['index']
# convertir l'index en id
cost_factor_ids_all = cost_factor_ids_all.reset_index()
# ajout de l'id avec l'index + 1
cost_factor_ids_all['cost_factor'] = cost_factor_ids_all['index'] + id_start
del cost_factor_ids_all['index']
cost_factor_ids_all
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1536-5964</td>
<td>0025-7974</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>1592-8721</td>
<td>0390-6078</td>
<td>1</td>
<td>2000</td>
<td>EUR</td>
<td>Source: DOAJ</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1555-3892</td>
<td>0963-6897</td>
<td>1</td>
<td>2750</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>1095-9572</td>
<td>1053-8119</td>
<td>1</td>
<td>3000</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>575</td>
<td>2469-9926</td>
<td>2469-9926</td>
<td>1</td>
<td>2156.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2015)</td>
<td>576</td>
</tr>
<tr>
<td>576</td>
<td>2469-9950</td>
<td>2469-9950</td>
<td>1</td>
<td>2143.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>577</td>
</tr>
<tr>
<td>577</td>
<td>2470-0010</td>
<td>2470-0010</td>
<td>1</td>
<td>1763.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>578</td>
</tr>
<tr>
<td>578</td>
<td>2470-0045</td>
<td>2470-0045</td>
<td>1</td>
<td>1211.45</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>579</td>
</tr>
<tr>
<td>579</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>580</td>
</tr>
</tbody>
</table>
<p>580 rows × 7 columns</p>
</div>
```python
# merge avec la table sherpa
sherpa = pd.merge(sherpa, cost_factor_ids_all[['issnl', 'cost_factor']], on='issnl', how='left')
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
<td>NaN</td>
<td>355.0</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
<td>NaN</td>
<td>355.0</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
<td>1.0</td>
<td>355.0</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
<td>2.0</td>
<td>355.0</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
<td>NaN</td>
<td>356.0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
<td>NaN</td>
<td>580.0</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
<td>NaN</td>
<td>580.0</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
<td>NaN</td>
<td>580.0</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
</tbody>
</table>
<p>8595 rows × 33 columns</p>
</div>
```python
sherpa.loc[sherpa['cost_factor'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>93</td>
<td>787</td>
<td>0002-9513</td>
<td>7391</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/11</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository</td>
<td>NaN</td>
<td>arXiv ; bioRxiv ; Preprint Repository</td>
<td>arXiv ; bioRxiv</td>
<td>NaN</td>
<td>authors</td>
<td>NaN</td>
<td>False</td>
<td>Must be assigned a DOI</td>
<td>Can not be deposited after submission to journal</td>
<td>94</td>
<td>0002-9513</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>94</td>
<td>787</td>
<td>0002-9513</td>
<td>7391</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/11</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>institutional_repository</td>
<td>Institutional Repository</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>publishers</td>
<td>NaN</td>
<td>True</td>
<td>Must link to publisher version with DOI</td>
<td>NaN</td>
<td>95</td>
<td>0002-9513</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>95</td>
<td>787</td>
<td>0002-9513</td>
<td>7391</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/11</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>True</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>us</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
<td>NaN</td>
<td>named_repository</td>
<td>NaN</td>
<td>PubMed Central</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>publishers</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>Must link to publisher version with DOI</td>
<td>NaN</td>
<td>96</td>
<td>0002-9513</td>
<td>3</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>96</td>
<td>787</td>
<td>0002-9513</td>
<td>7391</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/11</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>True</td>
<td>Wellcome Trust</td>
<td>http://dx.doi.org/10.13039/100004440</td>
<td>https://ror.org/029chgv08</td>
<td>gb</td>
<td>http://www.wellcome.ac.uk/</td>
<td>695.0</td>
<td>NaN</td>
<td>named_repository</td>
<td>NaN</td>
<td>PubMed Central</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>publishers</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>Must link to publisher version with DOI</td>
<td>NaN</td>
<td>97</td>
<td>0002-9513</td>
<td>3</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>97</td>
<td>787</td>
<td>0002-9513</td>
<td>7391</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/11</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>True</td>
<td>Medical Research Council (MRC)</td>
<td>http://dx.doi.org/10.13039/501100000265</td>
<td>https://ror.org/03x94j517</td>
<td>gb</td>
<td>http://www.mrc.ac.uk/index.htm</td>
<td>705.0</td>
<td>NaN</td>
<td>named_repository</td>
<td>NaN</td>
<td>PubMed Central</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>publishers</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>Must link to publisher version with DOI</td>
<td>NaN</td>
<td>98</td>
<td>0002-9513</td>
<td>3</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8199</td>
<td>565</td>
<td>1661-8157</td>
<td>8459</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3494</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; subject_repos...</td>
<td>Any Website</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>8200</td>
<td>1661-8157</td>
<td>3</td>
<td>8.0</td>
<td>NaN</td>
</tr>
<tr>
<td>8200</td>
<td>565</td>
<td>1661-8157</td>
<td>8459</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3494</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; subject_repos...</td>
<td>Any Website</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>8201</td>
<td>1661-8157</td>
<td>3</td>
<td>2.0</td>
<td>NaN</td>
</tr>
<tr>
<td>8373</td>
<td>530</td>
<td>1946-6234</td>
<td>11116</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository</td>
<td>NaN</td>
<td>arXiv ; bioRxiv ; Preprint Repository</td>
<td>arXiv ; bioRxiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>May be considered prior publication, contact j...</td>
<td>NaN</td>
<td>8374</td>
<td>1946-6234</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8374</td>
<td>530</td>
<td>1946-6234</td>
<td>11116</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository</td>
<td>Institutional Repository</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>authors</td>
<td>NaN</td>
<td>True</td>
<td>Published source must be acknowledged with DOI...</td>
<td>NaN</td>
<td>8375</td>
<td>1946-6234</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8375</td>
<td>530</td>
<td>1946-6234</td>
<td>11116</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>6</td>
<td>when_required_by_funder</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>funder_designated_location ; named_repository</td>
<td>NaN</td>
<td>Funder Designated Location ; PubMed Central</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>NaN</td>
<td>False</td>
<td>Must state on submission Funding agency requir...</td>
<td>NaN</td>
<td>8376</td>
<td>1946-6234</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>631 rows × 33 columns</p>
</div>
```python
# garder les APCs pour la version published
sherpa.loc[sherpa['article_version'] != 'published', 'cost_factor'] = np.nan
sherpa.loc[sherpa['cost_factor'].notna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
<td>1.0</td>
<td>355.0</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
<td>2.0</td>
<td>355.0</td>
</tr>
<tr>
<td>6</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>funder_designated_location ; named_repository ...</td>
<td>NaN</td>
<td>Funder Designated Location ; PubMed Central ; ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>publishers</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>NaN</td>
<td>NaN</td>
<td>7</td>
<td>0001-4842</td>
<td>3</td>
<td>1.0</td>
<td>356.0</td>
</tr>
<tr>
<td>7</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>funder_designated_location ; named_repository ...</td>
<td>NaN</td>
<td>Funder Designated Location ; PubMed Central ; ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>publishers</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>NaN</td>
<td>NaN</td>
<td>8</td>
<td>0001-4842</td>
<td>3</td>
<td>2.0</td>
<td>356.0</td>
</tr>
<tr>
<td>8</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>bespoke_license</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>funder_designated_location ; named_repository ...</td>
<td>NaN</td>
<td>Funder Designated Location ; PubMed Central ; ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>publishers</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>NaN</td>
<td>NaN</td>
<td>9</td>
<td>0001-4842</td>
<td>3</td>
<td>3.0</td>
<td>356.0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8588</td>
<td>533</td>
<td>2470-0045</td>
<td>31531</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8589</td>
<td>2470-0045</td>
<td>3</td>
<td>1.0</td>
<td>579.0</td>
</tr>
<tr>
<td>8589</td>
<td>533</td>
<td>2470-0045</td>
<td>31531</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8590</td>
<td>2470-0045</td>
<td>3</td>
<td>1.0</td>
<td>579.0</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
<td>NaN</td>
<td>580.0</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
</tbody>
</table>
<p>4462 rows × 33 columns</p>
</div>
```python
# renommer l'id du fichier sherpa brut
# cost_factor_ids_all = cost_factor_ids_all.rename(columns = {'id' : 'id_sherpa'})
cost_factor_ids_all = cost_factor_ids_all.rename(columns = {'cost_factor' : 'id'})
cost_factor_ids_all
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1536-5964</td>
<td>0025-7974</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>1592-8721</td>
<td>0390-6078</td>
<td>1</td>
<td>2000</td>
<td>EUR</td>
<td>Source: DOAJ</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1555-3892</td>
<td>0963-6897</td>
<td>1</td>
<td>2750</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>1095-9572</td>
<td>1053-8119</td>
<td>1</td>
<td>3000</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>575</td>
<td>2469-9926</td>
<td>2469-9926</td>
<td>1</td>
<td>2156.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2015)</td>
<td>576</td>
</tr>
<tr>
<td>576</td>
<td>2469-9950</td>
<td>2469-9950</td>
<td>1</td>
<td>2143.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>577</td>
</tr>
<tr>
<td>577</td>
<td>2470-0010</td>
<td>2470-0010</td>
<td>1</td>
<td>1763.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>578</td>
</tr>
<tr>
<td>578</td>
<td>2470-0045</td>
<td>2470-0045</td>
<td>1</td>
<td>1211.45</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>579</td>
</tr>
<tr>
<td>579</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>580</td>
</tr>
</tbody>
</table>
<p>580 rows × 7 columns</p>
</div>
```python
cost_factor_ids_all['id'] = cost_factor_ids_all['id'].astype(int)
```
```python
cost_factor_ids_all
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>issnl</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1083-351X</td>
<td>0021-9258</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1536-5964</td>
<td>0025-7974</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>1592-8721</td>
<td>0390-6078</td>
<td>1</td>
<td>2000</td>
<td>EUR</td>
<td>Source: DOAJ</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1555-3892</td>
<td>0963-6897</td>
<td>1</td>
<td>2750</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>1095-9572</td>
<td>1053-8119</td>
<td>1</td>
<td>3000</td>
<td>USD</td>
<td>Source: DOAJ</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>575</td>
<td>2469-9926</td>
<td>2469-9926</td>
<td>1</td>
<td>2156.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2015)</td>
<td>576</td>
</tr>
<tr>
<td>576</td>
<td>2469-9950</td>
<td>2469-9950</td>
<td>1</td>
<td>2143.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>577</td>
</tr>
<tr>
<td>577</td>
<td>2470-0010</td>
<td>2470-0010</td>
<td>1</td>
<td>1763.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>578</td>
</tr>
<tr>
<td>578</td>
<td>2470-0045</td>
<td>2470-0045</td>
<td>1</td>
<td>1211.45</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
<td>579</td>
</tr>
<tr>
<td>579</td>
<td>2475-9953</td>
<td>2475-9953</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
<td>580</td>
</tr>
</tbody>
</table>
<p>580 rows × 7 columns</p>
</div>
```python
cost_factor_export = cost_factor_ids_all[['id', 'cost_factor_type', 'amount', 'symbol', 'comment']]
cost_factor_export
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>1</td>
<td>2000</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>1</td>
<td>2750</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>3000</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>575</td>
<td>576</td>
<td>1</td>
<td>2156.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2015)</td>
</tr>
<tr>
<td>576</td>
<td>577</td>
<td>1</td>
<td>2143.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>577</td>
<td>578</td>
<td>1</td>
<td>1763.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>578</td>
<td>579</td>
<td>1</td>
<td>1211.45</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>579</td>
<td>580</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
</tr>
</tbody>
</table>
<p>580 rows × 5 columns</p>
</div>
```python
cost_factor_export.shape[0]
```
580
```python
# ajout de la valeur Rabais 100% pour les licences Read & Publish
rpid = cost_factor_export.shape[0] + 1
cost_factor_export = cost_factor_export.append({'id' : rpid, 'cost_factor_type' : 2, 'amount' : 100, 'symbol' : '%', 'comment' : 'Read & Publish agreement'}, ignore_index=True)
cost_factor_export
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>1</td>
<td>2000</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>1</td>
<td>2750</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>3000</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>576</td>
<td>577</td>
<td>1</td>
<td>2143.51</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>577</td>
<td>578</td>
<td>1</td>
<td>1763.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>578</td>
<td>579</td>
<td>1</td>
<td>1211.45</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>579</td>
<td>580</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
</tr>
<tr>
<td>580</td>
<td>581</td>
<td>2</td>
<td>100</td>
<td>%</td>
<td>Read &amp; Publish agreement</td>
</tr>
</tbody>
</table>
<p>581 rows × 5 columns</p>
</div>
```python
# ajout de l'id dans la table read & publish
rp['cost_factor'] = rpid
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>sherpa_code</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
</tbody>
</table>
<p>40083 rows × 16 columns</p>
</div>
```python
# ajout de la valeur UNKNOWN
cost_factor_export = cost_factor_export.append({'id' : 999999, 'cost_factor_type' : 999999, 'amount' : 0, 'symbol' : '', 'comment' : 'UNKNOWN'}, ignore_index=True)
cost_factor_export
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>cost_factor_type</th>
<th>amount</th>
<th>symbol</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>2500</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>1</td>
<td>1950</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>1</td>
<td>2000</td>
<td>EUR</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>1</td>
<td>2750</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>3000</td>
<td>USD</td>
<td>Source: DOAJ</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>577</td>
<td>578</td>
<td>1</td>
<td>1763.13</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>578</td>
<td>579</td>
<td>1</td>
<td>1211.45</td>
<td>EUR</td>
<td>Source: OpenAPC (2016)</td>
</tr>
<tr>
<td>579</td>
<td>580</td>
<td>1</td>
<td>2023.37</td>
<td>EUR</td>
<td>Source: OpenAPC (2017)</td>
</tr>
<tr>
<td>580</td>
<td>581</td>
<td>2</td>
<td>100</td>
<td>%</td>
<td>Read &amp; Publish agreement</td>
</tr>
<tr>
<td>581</td>
<td>999999</td>
<td>999999</td>
<td>0</td>
<td></td>
<td>UNKNOWN</td>
</tr>
</tbody>
</table>
<p>582 rows × 5 columns</p>
</div>
```python
# export de la table
result = cost_factor_export.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/cost_factor.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export csv
cost_factor_export.to_csv('sample/cost_factor.tsv', index=False)
```
```python
# export excel
cost_factor_export.to_excel('sample/cost_factor.xlsx', index=False)
```
## Table term
```python
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
<td>1.0</td>
<td>355.0</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
<td>2.0</td>
<td>355.0</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
<td>NaN</td>
<td>580.0</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
</tbody>
</table>
<p>8595 rows × 33 columns</p>
</div>
```python
# col_names = ['id', 'applicable_version', 'cost_factor', 'embargo', 'archiving']
term_sherpa = sherpa[['id', 'version', 'cost_factor', 'embargo', 'archiving', 'locations_ir', 'locations_not_ir', 'licence', 'journal', 'conditions', 'public_notes', 'prerequisite_funders', 'prerequisite_funders_ror']]
term_sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo</th>
<th>archiving</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>licence</th>
<th>journal</th>
<th>conditions</th>
<th>public_notes</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_ror</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>NaN</td>
<td>532</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>NaN</td>
<td>532</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>1.0</td>
<td>532</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>2.0</td>
<td>532</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>False</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>NaN</td>
<td>498</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>8591</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8591</td>
<td>8592</td>
<td>2</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8592</td>
<td>8593</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8593</td>
<td>8594</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>1.0</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8594</td>
<td>8595</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>1.0</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>8595 rows × 13 columns</p>
</div>
```python
# renommer les champs
term_sherpa = term_sherpa.rename(columns = {'id' : 'id_sherpa', 'embargo' : 'embargo_months', 'prerequisite_funders_ror' : 'ror'})
term_sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>licence</th>
<th>journal</th>
<th>conditions</th>
<th>public_notes</th>
<th>prerequisite_funders</th>
<th>ror</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>NaN</td>
<td>532</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>NaN</td>
<td>532</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>1.0</td>
<td>532</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>2.0</td>
<td>532</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>False</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>NaN</td>
<td>498</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>8591</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8591</td>
<td>8592</td>
<td>2</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8592</td>
<td>8593</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8593</td>
<td>8594</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>1.0</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8594</td>
<td>8595</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>1.0</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>8595 rows × 13 columns</p>
</div>
```python
# merge des champs dans le comment : conditions, public_notes, locations_not_ir
term_sherpa['conditions'] = term_sherpa['conditions'].fillna('')
term_sherpa['public_notes'] = term_sherpa['public_notes'].fillna('')
term_sherpa['locations_not_ir'] = term_sherpa['locations_not_ir'].fillna('')
term_sherpa['locations_ir'] = term_sherpa['locations_ir'].fillna('')
term_sherpa.loc[term_sherpa['locations_not_ir'] != '', 'locations_not_ir'] = 'Non institutional archiving locations: ' + term_sherpa['locations_not_ir']
term_sherpa.loc[term_sherpa['locations_ir'] != '', 'locations_ir'] = 'Institutional archiving locations: ' + term_sherpa['locations_ir']
term_sherpa.loc[term_sherpa['archiving'] == False, 'comment'] = term_sherpa['locations_not_ir']
term_sherpa.loc[term_sherpa['archiving'] == True, 'comment'] = term_sherpa['locations_ir']
term_sherpa.loc[term_sherpa['comment'] == '', 'comment'] = 'Conditions: ' + term_sherpa['conditions']
term_sherpa.loc[(term_sherpa['comment'] != '') & (term_sherpa['conditions'] != ''), 'comment'] = term_sherpa['comment'] + ' ; Conditions: ' + term_sherpa['conditions']
term_sherpa.loc[(term_sherpa['public_notes'] != '') & (term_sherpa['public_notes'] != term_sherpa['comment']), 'comment'] = term_sherpa['comment'] + ' ; Public notes: ' + term_sherpa['public_notes']
term_sherpa.loc[(term_sherpa['public_notes'] != '') & (term_sherpa['comment'] == ''), 'comment'] = 'Public notes: ' + term_sherpa['public_notes']
term_sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>licence</th>
<th>journal</th>
<th>conditions</th>
<th>public_notes</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>Non institutional archiving locations: Author'...</td>
<td>NaN</td>
<td>532</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>Non institutional archiving locations: Author'...</td>
<td>NaN</td>
<td>532</td>
<td>Publisher source must be acknowledged with cit...</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Any Website...</td>
<td>Non institutional archiving locations: PubMed ...</td>
<td>1.0</td>
<td>532</td>
<td>Published source must be acknowledged</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Any Website...</td>
<td>Non institutional archiving locations: PubMed ...</td>
<td>2.0</td>
<td>532</td>
<td>Published source must be acknowledged</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>False</td>
<td></td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>498</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>8591</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Institution...</td>
<td>Non institutional archiving locations: Author'...</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>8591</td>
<td>8592</td>
<td>2</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Institution...</td>
<td>Non institutional archiving locations: Author'...</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>8592</td>
<td>8593</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Institution...</td>
<td>Non institutional archiving locations: Author'...</td>
<td>NaN</td>
<td>608</td>
<td>Must link to published article ; Publisher cop...</td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>8593</td>
<td>8594</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Any Repository</td>
<td>Non institutional archiving locations: Journal...</td>
<td>1.0</td>
<td>608</td>
<td></td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Repository</td>
</tr>
<tr>
<td>8594</td>
<td>8595</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>Institutional archiving locations: Any Repository</td>
<td>Non institutional archiving locations: Journal...</td>
<td>1.0</td>
<td>608</td>
<td></td>
<td></td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Repository</td>
</tr>
</tbody>
</table>
<p>8595 rows × 14 columns</p>
</div>
```python
term_sherpa['prerequisite_funders'].value_counts()
```
True 5585
Name: prerequisite_funders, dtype: int64
```python
rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>issn</th>
<th>title</th>
<th>archiving</th>
<th>article_version</th>
<th>embargo_months</th>
<th>sherpa_code</th>
<th>valid_from</th>
<th>valid_until</th>
<th>issnl</th>
<th>ror</th>
<th>journal</th>
<th>rp_id</th>
<th>rp_publisher</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/04d8ztx87</td>
<td>899.0</td>
<td>1</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>1</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02bnkt322</td>
<td>899.0</td>
<td>2</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>2</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/00zg4za48</td>
<td>899.0</td>
<td>3</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>3</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/02s376052</td>
<td>899.0</td>
<td>4</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>4</td>
<td>1742-7061</td>
<td>Acta Biomaterialia</td>
<td>True</td>
<td>published</td>
<td>0</td>
<td>cc_by</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1742-7061</td>
<td>https://ror.org/05a28rw58</td>
<td>899.0</td>
<td>5</td>
<td>Elsevier</td>
<td>3</td>
<td>1</td>
<td>581</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/01swzsf04</td>
<td>592.0</td>
<td>40079</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40079</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/019whta54</td>
<td>592.0</td>
<td>40080</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40080</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/00vasag41</td>
<td>592.0</td>
<td>40081</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40081</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05r0ap620</td>
<td>592.0</td>
<td>40082</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
<tr>
<td>40082</td>
<td>1435-8115</td>
<td>Microscopy and Microanalysis</td>
<td>True</td>
<td>published</td>
<td>60</td>
<td>cc_by_nc_sa</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1431-9276</td>
<td>https://ror.org/05pmsvm27</td>
<td>592.0</td>
<td>40083</td>
<td>CUP</td>
<td>3</td>
<td>5</td>
<td>581</td>
</tr>
</tbody>
</table>
<p>40083 rows × 16 columns</p>
</div>
```python
term_rp = rp[['rp_id', 'version', 'archiving', 'embargo_months', 'cost_factor', 'licence', 'journal', 'rp_publisher', 'ror', 'valid_from', 'valid_until']]
term_rp
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>rp_id</th>
<th>version</th>
<th>archiving</th>
<th>embargo_months</th>
<th>cost_factor</th>
<th>licence</th>
<th>journal</th>
<th>rp_publisher</th>
<th>ror</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>Elsevier</td>
<td>https://ror.org/04d8ztx87</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>Elsevier</td>
<td>https://ror.org/02bnkt322</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>Elsevier</td>
<td>https://ror.org/00zg4za48</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>Elsevier</td>
<td>https://ror.org/02s376052</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>Elsevier</td>
<td>https://ror.org/05a28rw58</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>40079</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>CUP</td>
<td>https://ror.org/01swzsf04</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>40079</td>
<td>40080</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>CUP</td>
<td>https://ror.org/019whta54</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>40080</td>
<td>40081</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>CUP</td>
<td>https://ror.org/00vasag41</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>40081</td>
<td>40082</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>CUP</td>
<td>https://ror.org/05r0ap620</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>40082</td>
<td>40083</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>CUP</td>
<td>https://ror.org/05pmsvm27</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>40083 rows × 11 columns</p>
</div>
```python
term_rp['rp_publisher'].value_counts()
```
Elsevier 18128
Wiley 13905
Springer Nature 6716
CUP 920
TF 414
Name: rp_publisher, dtype: int64
```python
term_rp.loc[term_rp['rp_publisher'] == 'Elsevier', 'comment'] = 'Elsevier Read & Publish agreement'
term_rp.loc[term_rp['rp_publisher'] == 'Wiley', 'comment'] = 'Wiley Read & Publish agreement'
term_rp.loc[term_rp['rp_publisher'] == 'TF', 'comment'] = 'Taylor and Francis Read & Publish agreement'
term_rp.loc[term_rp['rp_publisher'] == 'Springer Nature ', 'comment'] = 'Springer Nature Read & Publish agreement'
term_rp.loc[term_rp['rp_publisher'] == 'CUP', 'comment'] = 'Cambridge University Press (CUP) Read & Publish agreement. Article types covered: Research Articles, Review Articles, Rapid Communication, Brief Reports and Case Reports'
del term_rp['rp_publisher']
term_rp
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\pandas\core\indexing.py:376: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
self.obj[key] = _infer_fill_value(value)
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\pandas\core\indexing.py:494: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
self.obj[item] = s
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>rp_id</th>
<th>version</th>
<th>archiving</th>
<th>embargo_months</th>
<th>cost_factor</th>
<th>licence</th>
<th>journal</th>
<th>ror</th>
<th>valid_from</th>
<th>valid_until</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>https://ror.org/04d8ztx87</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>https://ror.org/02bnkt322</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>https://ror.org/00zg4za48</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>https://ror.org/02s376052</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>3</td>
<td>True</td>
<td>0</td>
<td>581</td>
<td>1</td>
<td>899.0</td>
<td>https://ror.org/05a28rw58</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>40079</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>https://ror.org/01swzsf04</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>40079</td>
<td>40080</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>https://ror.org/019whta54</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>40080</td>
<td>40081</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>https://ror.org/00vasag41</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>40081</td>
<td>40082</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>https://ror.org/05r0ap620</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>40082</td>
<td>40083</td>
<td>3</td>
<td>True</td>
<td>60</td>
<td>581</td>
<td>5</td>
<td>592.0</td>
<td>https://ror.org/05pmsvm27</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
</tbody>
</table>
<p>40083 rows × 11 columns</p>
</div>
```python
# cocnat de deux tables
term_orig = term_sherpa[['id_sherpa', 'version', 'cost_factor', 'embargo_months', 'archiving', 'licence', 'journal', 'prerequisite_funders', 'ror', 'comment']]
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>NaN</td>
<td>532</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>532</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>532</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>2.0</td>
<td>532</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>False</td>
<td>NaN</td>
<td>498</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>8591</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>NaN</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>8591</td>
<td>8592</td>
<td>2</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>NaN</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>8592</td>
<td>8593</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>NaN</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>8593</td>
<td>8594</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Repository</td>
</tr>
<tr>
<td>8594</td>
<td>8595</td>
<td>3</td>
<td>580.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>608</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Repository</td>
</tr>
</tbody>
</table>
<p>8595 rows × 10 columns</p>
</div>
```python
term_orig = term_orig.append(term_rp, ignore_index=True, sort=False)
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>True</td>
<td>NaN</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355.0</td>
<td>0</td>
<td>True</td>
<td>2.0</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>NaN</td>
<td>0</td>
<td>False</td>
<td>NaN</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581.0</td>
<td>60</td>
<td>True</td>
<td>5.0</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581.0</td>
<td>60</td>
<td>True</td>
<td>5.0</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581.0</td>
<td>60</td>
<td>True</td>
<td>5.0</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581.0</td>
<td>60</td>
<td>True</td>
<td>5.0</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581.0</td>
<td>60</td>
<td>True</td>
<td>5.0</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>48678 rows × 13 columns</p>
</div>
```python
# ajout d'un hash unique pour chaque variante
term_orig['id_content_hash'] = term_orig.apply(lambda x: hash(tuple(x[['version', 'cost_factor', 'embargo_months', 'archiving', 'comment']])), axis = 1)
term_orig['id_content_hash_licence'] = term_orig.apply(lambda x: hash(tuple(x[['version', 'cost_factor', 'embargo_months', 'archiving', 'licence', 'comment']])), axis = 1)
```
```python
term_orig.sort_values(by='id_content_hash')
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
</tr>
</thead>
<tbody>
<tr>
<td>6599</td>
<td>6600.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>923.0</td>
<td>True</td>
<td>https://ror.org/056y81r79</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>6867</td>
<td>6868.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>957.0</td>
<td>True</td>
<td>https://ror.org/056bwcz71</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>4750</td>
<td>4751.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>642.0</td>
<td>True</td>
<td>https://ror.org/05w9mt194</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>8236</td>
<td>8237.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>640.0</td>
<td>True</td>
<td>https://ror.org/02wxr8x18</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>8237</td>
<td>8238.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>640.0</td>
<td>True</td>
<td>https://ror.org/056y81r79</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>6353</td>
<td>6354.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/02wdwnk04</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6352</td>
<td>6353.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/029chgv08</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6362</td>
<td>6363.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/0472cxd90</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6357</td>
<td>6358.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/0456r8d26</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6363</td>
<td>6364.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/03x94j517</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
</tbody>
</table>
<p>48678 rows × 15 columns</p>
</div>
```python
# doublons
term_orig.loc[term_orig.duplicated(subset='id_content_hash')].sort_values(by='id_content_hash')
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
</tr>
</thead>
<tbody>
<tr>
<td>6607</td>
<td>6608.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>175.0</td>
<td>True</td>
<td>https://ror.org/02wxr8x18</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>6508</td>
<td>6509.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>64.0</td>
<td>True</td>
<td>https://ror.org/05w9mt194</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>1294</td>
<td>1295.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>342.0</td>
<td>True</td>
<td>https://ror.org/056bwcz71</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>5561</td>
<td>5562.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>27.0</td>
<td>True</td>
<td>https://ror.org/05w9mt194</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>5559</td>
<td>5560.0</td>
<td>2</td>
<td>NaN</td>
<td>12</td>
<td>True</td>
<td>NaN</td>
<td>27.0</td>
<td>True</td>
<td>https://ror.org/056y81r79</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>6355</td>
<td>6356.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/00cwqg982</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6354</td>
<td>6355.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/02jkpm469</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6353</td>
<td>6354.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/02wdwnk04</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6364</td>
<td>6365.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/02gq0fg61</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
<tr>
<td>6359</td>
<td>6360.0</td>
<td>3</td>
<td>222.0</td>
<td>0</td>
<td>True</td>
<td>1.0</td>
<td>190.0</td>
<td>True</td>
<td>https://ror.org/01613vh25</td>
<td>Institutional archiving locations: Institution...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
</tr>
</tbody>
</table>
<p>47358 rows × 15 columns</p>
</div>
```python
term_orig['licence'] = term_orig['licence'].fillna(999999)
term_orig['licence'] = term_orig['licence'].astype(int)
term_orig['cost_factor'] = term_orig['cost_factor'].fillna(999999)
term_orig['cost_factor'] = term_orig['cost_factor'].astype(int)
# term_orig['embargo_months'] = term_orig['embargo_months'].fillna(0)
# term_orig['embargo_months'] = term_orig['embargo_months'].astype(int)
term_orig.loc[term_orig['archiving'] == True, 'ir_archiving'] = 1
term_orig.loc[term_orig['archiving'] == False, 'ir_archiving'] = 0
term_orig['ir_archiving'] = term_orig['ir_archiving'].fillna(0)
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1.0</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1.0</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1.0</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1.0</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0.0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1.0</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1.0</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1.0</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1.0</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1.0</td>
</tr>
</tbody>
</table>
<p>48678 rows × 16 columns</p>
</div>
```python
term_orig.loc[term_orig['ir_archiving'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
term_orig['ir_archiving'].value_counts()
```
1.0 47467
0.0 1211
Name: ir_archiving, dtype: int64
```python
term_orig['licence'] = term_orig['licence'].astype(int)
term_orig['ir_archiving'] = term_orig['ir_archiving'].astype(int)
term_orig['cost_factor'] = term_orig['cost_factor'].astype(int)
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
</tbody>
</table>
<p>48678 rows × 16 columns</p>
</div>
```python
terms_export_dates = term_orig.loc[(term_orig['valid_from'].notna()) | (term_orig['valid_until'].notna())][['id_content_hash', 'ror', 'valid_from', 'valid_until']]
terms_export_dates
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_content_hash</th>
<th>ror</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>8595</td>
<td>-6020029623494903364</td>
<td>https://ror.org/04d8ztx87</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>8596</td>
<td>-6020029623494903364</td>
<td>https://ror.org/02bnkt322</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>8597</td>
<td>-6020029623494903364</td>
<td>https://ror.org/00zg4za48</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>8598</td>
<td>-6020029623494903364</td>
<td>https://ror.org/02s376052</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>8599</td>
<td>-6020029623494903364</td>
<td>https://ror.org/05a28rw58</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>7687377827846095855</td>
<td>https://ror.org/01swzsf04</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48674</td>
<td>7687377827846095855</td>
<td>https://ror.org/019whta54</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48675</td>
<td>7687377827846095855</td>
<td>https://ror.org/00vasag41</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48676</td>
<td>7687377827846095855</td>
<td>https://ror.org/05r0ap620</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48677</td>
<td>7687377827846095855</td>
<td>https://ror.org/05pmsvm27</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>40083 rows × 4 columns</p>
</div>
```python
terms_export = term_orig[['id_sherpa', 'rp_id', 'id_content_hash', 'id_content_hash_licence', 'version', 'cost_factor', 'embargo_months', 'ir_archiving', 'licence', 'comment']]
terms_export
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>40079.0</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>5</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>40080.0</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>5</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>40081.0</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>5</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>40082.0</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>5</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>40083.0</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>5</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
</tbody>
</table>
<p>48678 rows × 10 columns</p>
</div>
```python
# test de doublons
terms_export.loc[terms_export.duplicated(subset='id_content_hash')].sort_values(by='id_content_hash')
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>6607</td>
<td>6608.0</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>6508</td>
<td>6509.0</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>1294</td>
<td>1295.0</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>5561</td>
<td>5562.0</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>5559</td>
<td>5560.0</td>
<td>NaN</td>
<td>-9213354388875732238</td>
<td>-5975042390572407328</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>6355</td>
<td>6356.0</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
<td>3</td>
<td>222</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>6354</td>
<td>6355.0</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
<td>3</td>
<td>222</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>6353</td>
<td>6354.0</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
<td>3</td>
<td>222</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>6364</td>
<td>6365.0</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
<td>3</td>
<td>222</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
<tr>
<td>6359</td>
<td>6360.0</td>
<td>NaN</td>
<td>9219045216097074691</td>
<td>-8427874628140339220</td>
<td>3</td>
<td>222</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
</tbody>
</table>
<p>47358 rows × 10 columns</p>
</div>
```python
terms_export_dedup = terms_export.drop_duplicates(subset=['id_content_hash'])
terms_export_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8595</td>
<td>NaN</td>
<td>1.0</td>
<td>-6020029623494903364</td>
<td>-5435886237991661497</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>26723</td>
<td>NaN</td>
<td>18129.0</td>
<td>-1955262099488276438</td>
<td>6359482801433181261</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>33439</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
</tr>
<tr>
<td>47344</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
</tr>
<tr>
<td>47758</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
</tbody>
</table>
<p>1320 rows × 10 columns</p>
</div>
```python
terms_export_dedup_licence = terms_export.drop_duplicates(subset=['id_content_hash_licence'])
terms_export_dedup_licence
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>47344</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
</tr>
<tr>
<td>47758</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>47988</td>
<td>NaN</td>
<td>39394.0</td>
<td>7687377827846095855</td>
<td>2298497766188448059</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>4</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>48218</td>
<td>NaN</td>
<td>39624.0</td>
<td>7687377827846095855</td>
<td>2298486079450211665</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>2</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
<tr>
<td>48448</td>
<td>NaN</td>
<td>39854.0</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>5</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
</tbody>
</table>
<p>1590 rows × 10 columns</p>
</div>
```python
# test de doublons
terms_export_dedup_licence.loc[terms_export_dedup_licence.duplicated(subset='id_content_hash')].sort_values(by='id_content_hash')
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>1569</td>
<td>1570.0</td>
<td>NaN</td>
<td>-9114006443623277513</td>
<td>-7273388776362060491</td>
<td>3</td>
<td>413</td>
<td>0</td>
<td>0</td>
<td>2</td>
<td>Non institutional archiving locations: PubMed ...</td>
</tr>
<tr>
<td>582</td>
<td>583.0</td>
<td>NaN</td>
<td>-9011072484834895623</td>
<td>-5911605112402338889</td>
<td>3</td>
<td>379</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Reposit...</td>
</tr>
<tr>
<td>8553</td>
<td>8554.0</td>
<td>NaN</td>
<td>-8861630054613228454</td>
<td>7176773088076624015</td>
<td>3</td>
<td>573</td>
<td>0</td>
<td>0</td>
<td>3</td>
<td>Non institutional archiving locations: Funder ...</td>
</tr>
<tr>
<td>8552</td>
<td>8553.0</td>
<td>NaN</td>
<td>-8861630054613228454</td>
<td>7176773474396433690</td>
<td>3</td>
<td>573</td>
<td>0</td>
<td>0</td>
<td>2</td>
<td>Non institutional archiving locations: Funder ...</td>
</tr>
<tr>
<td>8264</td>
<td>8265.0</td>
<td>NaN</td>
<td>-8856152899298491735</td>
<td>-1219996111910161561</td>
<td>3</td>
<td>560</td>
<td>0</td>
<td>1</td>
<td>4</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8560</td>
<td>8561.0</td>
<td>NaN</td>
<td>8735446932641542951</td>
<td>4332046250364995695</td>
<td>3</td>
<td>574</td>
<td>0</td>
<td>0</td>
<td>2</td>
<td>Non institutional archiving locations: Funder ...</td>
</tr>
<tr>
<td>8561</td>
<td>8562.0</td>
<td>NaN</td>
<td>8735446932641542951</td>
<td>4332048117937865978</td>
<td>3</td>
<td>574</td>
<td>0</td>
<td>0</td>
<td>3</td>
<td>Non institutional archiving locations: Funder ...</td>
</tr>
<tr>
<td>2222</td>
<td>2223.0</td>
<td>NaN</td>
<td>8745253383893524719</td>
<td>521134702989893722</td>
<td>3</td>
<td>431</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4152</td>
<td>4153.0</td>
<td>NaN</td>
<td>8845243756736955098</td>
<td>6100113456095422831</td>
<td>3</td>
<td>464</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4351</td>
<td>4352.0</td>
<td>NaN</td>
<td>9036026380223066491</td>
<td>-1539490241665655036</td>
<td>3</td>
<td>470</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Institution...</td>
</tr>
</tbody>
</table>
<p>270 rows × 10 columns</p>
</div>
```python
# totaux pour les deux sources
terms_export_dedup.loc[terms_export_dedup['id_sherpa'].notna()].shape[0]
```
1315
```python
terms_export_dedup.loc[terms_export_dedup['rp_id'].notna()].shape[0]
```
5
```python
terms_export_dedup.loc[terms_export_dedup['rp_id'].notna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>8595</td>
<td>NaN</td>
<td>1.0</td>
<td>-6020029623494903364</td>
<td>-5435886237991661497</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>26723</td>
<td>NaN</td>
<td>18129.0</td>
<td>-1955262099488276438</td>
<td>6359482801433181261</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>33439</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
</tr>
<tr>
<td>47344</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
</tr>
<tr>
<td>47758</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
</tbody>
</table>
</div>
```python
# convertir l'index en id
terms_export_dedup.reset_index(inplace=True)
del terms_export_dedup['index']
terms_export_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1315</td>
<td>NaN</td>
<td>1.0</td>
<td>-6020029623494903364</td>
<td>-5435886237991661497</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Elsevier Read &amp; Publish agreement</td>
</tr>
<tr>
<td>1316</td>
<td>NaN</td>
<td>18129.0</td>
<td>-1955262099488276438</td>
<td>6359482801433181261</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>1317</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
</tr>
<tr>
<td>1318</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
</tr>
<tr>
<td>1319</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
</tr>
</tbody>
</table>
<p>1320 rows × 10 columns</p>
</div>
```python
# ajout de l'id avec l'index + 1
terms_export_dedup['id'] = terms_export_dedup.index + 1
# del terms_export_dedup['index']
terms_export_dedup
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:2: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1315</td>
<td>NaN</td>
<td>1.0</td>
<td>-6020029623494903364</td>
<td>-5435886237991661497</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Elsevier Read &amp; Publish agreement</td>
<td>1316</td>
</tr>
<tr>
<td>1316</td>
<td>NaN</td>
<td>18129.0</td>
<td>-1955262099488276438</td>
<td>6359482801433181261</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>1317</td>
</tr>
<tr>
<td>1317</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
<td>1318</td>
</tr>
<tr>
<td>1318</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
<td>1319</td>
</tr>
<tr>
<td>1319</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>1320 rows × 11 columns</p>
</div>
```python
terms_export_dedup['source'] = ''
terms_export_dedup
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:1: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
"""Entry point for launching an IPython kernel.
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
<th>id</th>
<th>source</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>1</td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>2</td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
<td>3</td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
<td>4</td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>5</td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1315</td>
<td>NaN</td>
<td>1.0</td>
<td>-6020029623494903364</td>
<td>-5435886237991661497</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Elsevier Read &amp; Publish agreement</td>
<td>1316</td>
<td></td>
</tr>
<tr>
<td>1316</td>
<td>NaN</td>
<td>18129.0</td>
<td>-1955262099488276438</td>
<td>6359482801433181261</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>1317</td>
<td></td>
</tr>
<tr>
<td>1317</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
<td>1318</td>
<td></td>
</tr>
<tr>
<td>1318</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
<td>1319</td>
<td></td>
</tr>
<tr>
<td>1319</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>1320</td>
<td></td>
</tr>
</tbody>
</table>
<p>1320 rows × 12 columns</p>
</div>
```python
# grouper par licence
terms_export_dedup_licences = terms_export_dedup_licence[['licence', 'id_content_hash']]
terms_export_dedup_licences
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>licence</th>
<th>id_content_hash</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>999999</td>
<td>-5068777248818105392</td>
</tr>
<tr>
<td>1</td>
<td>999999</td>
<td>-1187146317861229577</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>-6827815856646016670</td>
</tr>
<tr>
<td>3</td>
<td>2</td>
<td>5388365857945903435</td>
</tr>
<tr>
<td>4</td>
<td>999999</td>
<td>-2781821769548802966</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>47344</td>
<td>1</td>
<td>6747956201225830719</td>
</tr>
<tr>
<td>47758</td>
<td>1</td>
<td>7687377827846095855</td>
</tr>
<tr>
<td>47988</td>
<td>4</td>
<td>7687377827846095855</td>
</tr>
<tr>
<td>48218</td>
<td>2</td>
<td>7687377827846095855</td>
</tr>
<tr>
<td>48448</td>
<td>5</td>
<td>7687377827846095855</td>
</tr>
</tbody>
</table>
<p>1590 rows × 2 columns</p>
</div>
```python
# concat valeurs avec même id
terms_export_dedup_licences['licence'] = terms_export_dedup_licences['licence'].astype(str)
terms_export_dedup_licences = terms_export_dedup_licences.groupby('id_content_hash').agg({'licence': lambda x: ', '.join(x)})
terms_export_dedup_licences
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:2: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>licence</th>
</tr>
<tr>
<th>id_content_hash</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>-9213354388875732238</td>
<td>999999</td>
</tr>
<tr>
<td>-9200070744422558377</td>
<td>999999</td>
</tr>
<tr>
<td>-9171783117023104395</td>
<td>1</td>
</tr>
<tr>
<td>-9134952646468948163</td>
<td>1</td>
</tr>
<tr>
<td>-9133013648751406289</td>
<td>1</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>9195001330432352893</td>
<td>1</td>
</tr>
<tr>
<td>9200466168345981543</td>
<td>1</td>
</tr>
<tr>
<td>9213878808178729253</td>
<td>2</td>
</tr>
<tr>
<td>9218389208912777882</td>
<td>2</td>
</tr>
<tr>
<td>9219045216097074691</td>
<td>1</td>
</tr>
</tbody>
</table>
<p>1320 rows × 1 columns</p>
</div>
```python
# test des valeur multiples
terms_export_dedup_licences.loc[terms_export_dedup_licences['licence'].str.contains(',')]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>licence</th>
</tr>
<tr>
<th>id_content_hash</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>-9114006443623277513</td>
<td>1, 2</td>
</tr>
<tr>
<td>-9011072484834895623</td>
<td>1, 2</td>
</tr>
<tr>
<td>-8861630054613228454</td>
<td>1, 2, 3</td>
</tr>
<tr>
<td>-8856152899298491735</td>
<td>1, 4</td>
</tr>
<tr>
<td>-8607167568720519189</td>
<td>1, 4</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8712161777436385390</td>
<td>1, 4</td>
</tr>
<tr>
<td>8735446932641542951</td>
<td>1, 2, 3</td>
</tr>
<tr>
<td>8745253383893524719</td>
<td>2, 1</td>
</tr>
<tr>
<td>8845243756736955098</td>
<td>2, 1</td>
</tr>
<tr>
<td>9036026380223066491</td>
<td>2, 1</td>
</tr>
</tbody>
</table>
<p>185 rows × 1 columns</p>
</div>
```python
# ajout des licences groupées
terms_export_dedup_fin = pd.merge(terms_export_dedup, terms_export_dedup_licences, on='id_content_hash', how='left')
terms_export_dedup_fin
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence_x</th>
<th>comment</th>
<th>id</th>
<th>source</th>
<th>licence_y</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>1</td>
<td></td>
<td>999999</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>2</td>
<td></td>
<td>999999</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
<td>3</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
<td>4</td>
<td></td>
<td>2</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>5</td>
<td></td>
<td>999999</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1315</td>
<td>NaN</td>
<td>1.0</td>
<td>-6020029623494903364</td>
<td>-5435886237991661497</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Elsevier Read &amp; Publish agreement</td>
<td>1316</td>
<td></td>
<td>1, 2</td>
</tr>
<tr>
<td>1316</td>
<td>NaN</td>
<td>18129.0</td>
<td>-1955262099488276438</td>
<td>6359482801433181261</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>1317</td>
<td></td>
<td>1, 4</td>
</tr>
<tr>
<td>1317</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
<td>1318</td>
<td></td>
<td>1, 4, 2</td>
</tr>
<tr>
<td>1318</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
<td>1319</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>1319</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>1320</td>
<td></td>
<td>1, 4, 2, 5</td>
</tr>
</tbody>
</table>
<p>1320 rows × 13 columns</p>
</div>
```python
# merge avec les dates pour avoir les terms ids
terms_export_dates = pd.merge(terms_export_dates, terms_export_dedup_fin[['id_content_hash', 'id']], on='id_content_hash')
terms_export_dates = terms_export_dates.rename(columns = {'id' : 'term'})
terms_export_dates
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_content_hash</th>
<th>ror</th>
<th>valid_from</th>
<th>valid_until</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>-6020029623494903364</td>
<td>https://ror.org/04d8ztx87</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>1</td>
<td>-6020029623494903364</td>
<td>https://ror.org/02bnkt322</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>2</td>
<td>-6020029623494903364</td>
<td>https://ror.org/00zg4za48</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>3</td>
<td>-6020029623494903364</td>
<td>https://ror.org/02s376052</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>4</td>
<td>-6020029623494903364</td>
<td>https://ror.org/05a28rw58</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>7687377827846095855</td>
<td>https://ror.org/01swzsf04</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40079</td>
<td>7687377827846095855</td>
<td>https://ror.org/019whta54</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40080</td>
<td>7687377827846095855</td>
<td>https://ror.org/00vasag41</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40081</td>
<td>7687377827846095855</td>
<td>https://ror.org/05r0ap620</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40082</td>
<td>7687377827846095855</td>
<td>https://ror.org/05pmsvm27</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>40083 rows × 5 columns</p>
</div>
```python
# renommer les champs de licence
del terms_export_dedup_fin['licence_x']
terms_export_dedup_fin = terms_export_dedup_fin.rename(columns = {'licence_y' : 'licence'})
```
```python
terms_export_fin = terms_export_dedup_fin[['version', 'cost_factor', 'embargo_months', 'ir_archiving', 'licence', 'comment', 'id', 'source']]
terms_export_fin
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
<th>id</th>
<th>source</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>1</td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>2</td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
<td>3</td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
<td>4</td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>5</td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1315</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1, 2</td>
<td>Elsevier Read &amp; Publish agreement</td>
<td>1316</td>
<td></td>
</tr>
<tr>
<td>1316</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1, 4</td>
<td>NaN</td>
<td>1317</td>
<td></td>
</tr>
<tr>
<td>1317</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1, 4, 2</td>
<td>Wiley Read &amp; Publish agreement</td>
<td>1318</td>
<td></td>
</tr>
<tr>
<td>1318</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
<td>1319</td>
<td></td>
</tr>
<tr>
<td>1319</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1, 4, 2, 5</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>1320</td>
<td></td>
</tr>
</tbody>
</table>
<p>1320 rows × 8 columns</p>
</div>
```python
# export de la table
result = terms_export_fin.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/term.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export csv
terms_export_fin.to_csv('sample/term.tsv', index=False)
```
```python
# export excel
terms_export_fin.to_excel('sample/term.xlsx', index=False)
```
## Table condition_type
```python
# Journal-only, Organization-only, Journal-organization agreement
col_names = ['id',
'condition_issuer'
]
condition_type = pd.DataFrame(columns = col_names)
condition_type = condition_type.append({'id' : 1, 'condition_issuer' : 'Journal-only'}, ignore_index=True)
condition_type = condition_type.append({'id' : 2, 'condition_issuer' : 'Organization-only'}, ignore_index=True)
condition_type = condition_type.append({'id' : 3, 'condition_issuer' : 'Journal-organization agreement'}, ignore_index=True)
condition_type
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>condition_issuer</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>Journal-only</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>Organization-only</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>Journal-organization agreement</td>
</tr>
</tbody>
</table>
</div>
```python
# export de la table
result = condition_type.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/condition_type.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export csv
condition_type.to_csv('sample/condition_type.tsv', index=False)
```
```python
# export excel
condition_type.to_excel('sample/condition_type.xlsx', index=False)
```
## Table organization
```python
# extraction des organizations (funders)
sherpa
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must acknowledge acceptance for publication ; ...</td>
<td>NaN</td>
<td>1</td>
<td>0001-2815</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/2050</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; named_repository ; non_comm...</td>
<td>Non-Commercial Institutional Repository</td>
<td>Author's Homepage ; arXiv ; AgEcon ; PhilPaper...</td>
<td>arXiv ; AgEcon ; PhilPapers ; PubMed Central ;...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Publisher source must be acknowledged with cit...</td>
<td>NaN</td>
<td>2</td>
<td>0001-2815</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; institutional_repository ; named...</td>
<td>Any Website ; Institutional Repository</td>
<td>PubMed Central ; Subject Repository ; Journal ...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>3</td>
<td>0001-2815</td>
<td>3</td>
<td>1.0</td>
<td>355.0</td>
</tr>
<tr>
<td>3</td>
<td>532</td>
<td>0001-2815</td>
<td>11905</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3315</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by_nc_nd</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_website ; named_repository ; non_commercia...</td>
<td>Any Website ; Non-Commercial Institutional Rep...</td>
<td>PubMed Central ; Non-Commercial Subject Reposi...</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>authors</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged</td>
<td>NaN</td>
<td>4</td>
<td>0001-2815</td>
<td>3</td>
<td>2.0</td>
<td>355.0</td>
</tr>
<tr>
<td>4</td>
<td>498</td>
<td>0001-4842</td>
<td>7760</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/4</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>named_repository ; preprint_repository ; subje...</td>
<td>NaN</td>
<td>ChemRxiv ; bioRxiv ; arXiv ; Preprint Reposito...</td>
<td>ChemRxiv ; bioRxiv ; arXiv</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>False</td>
<td>Must not violate ACS ethical Guidelines ; Must...</td>
<td>NaN</td>
<td>5</td>
<td>0001-4842</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8590</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>submitted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8591</td>
<td>2475-9953</td>
<td>1</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8591</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8592</td>
<td>2475-9953</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>8592</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>authors_homepage ; institutional_repository ; ...</td>
<td>Institutional Repository ; Institutional Website</td>
<td>Author's Homepage</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>Must link to published article ; Publisher cop...</td>
<td>NaN</td>
<td>8593</td>
<td>2475-9953</td>
<td>3</td>
<td>NaN</td>
<td>580.0</td>
</tr>
<tr>
<td>8593</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8594</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
<tr>
<td>8594</td>
<td>608</td>
<td>2475-9953</td>
<td>33503</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/10</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>any_repository ; this_journal</td>
<td>Any Repository</td>
<td>Journal Website</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>True</td>
<td>NaN</td>
<td>NaN</td>
<td>8595</td>
<td>2475-9953</td>
<td>3</td>
<td>1.0</td>
<td>580.0</td>
</tr>
</tbody>
</table>
<p>8595 rows × 33 columns</p>
</div>
```python
sherpa.loc[sherpa['prerequisite_funders'].notna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
<th>issn</th>
<th>sherpa_id</th>
<th>sherpa_uri</th>
<th>open_access_prohibited</th>
<th>additional_oa_fee</th>
<th>article_version</th>
<th>sherpa_code</th>
<th>embargo</th>
<th>prerequisites</th>
<th>prerequisite_funders</th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
<th>prerequisite_subjects</th>
<th>location</th>
<th>locations_ir</th>
<th>locations_not_ir</th>
<th>named_repository</th>
<th>named_academic_social_network</th>
<th>copyright_owner</th>
<th>publisher_deposit</th>
<th>archiving</th>
<th>conditions</th>
<th>public_notes</th>
<th>id</th>
<th>issnl</th>
<th>version</th>
<th>licence</th>
<th>cost_factor</th>
</tr>
</thead>
<tbody>
<tr>
<td>16</td>
<td>789</td>
<td>0001-4966</td>
<td>4049</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/126</td>
<td>no</td>
<td>no</td>
<td>published</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>True</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>us</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
<td>NaN</td>
<td>named_repository</td>
<td>NaN</td>
<td>PubMed Central</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>NaN</td>
<td>NaN</td>
<td>17</td>
<td>0001-4966</td>
<td>3</td>
<td>NaN</td>
<td>357.0</td>
</tr>
<tr>
<td>28</td>
<td>668</td>
<td>0002-0729</td>
<td>1334</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/1107</td>
<td>no</td>
<td>no</td>
<td>accepted</td>
<td>NaN</td>
<td>12</td>
<td>NaN</td>
<td>True</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>us</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
<td>NaN</td>
<td>named_repository</td>
<td>NaN</td>
<td>PubMed Central</td>
<td>PubMed Central</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>False</td>
<td>NaN</td>
<td>NaN</td>
<td>29</td>
<td>0002-0729</td>
<td>2</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>58</td>
<td>985</td>
<td>0002-9343</td>
<td>12950</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>Wellcome Trust</td>
<td>http://dx.doi.org/10.13039/100004440</td>
<td>https://ror.org/029chgv08</td>
<td>gb</td>
<td>http://www.wellcome.ac.uk/</td>
<td>695.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>59</td>
<td>0002-9343</td>
<td>3</td>
<td>1.0</td>
<td>223.0</td>
</tr>
<tr>
<td>59</td>
<td>985</td>
<td>0002-9343</td>
<td>12950</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>British Heart Foundation (BHF)</td>
<td>http://dx.doi.org/10.13039/501100000274</td>
<td>https://ror.org/02wdwnk04</td>
<td>gb</td>
<td>http://www.bhf.org.uk/</td>
<td>18.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>60</td>
<td>0002-9343</td>
<td>3</td>
<td>1.0</td>
<td>223.0</td>
</tr>
<tr>
<td>60</td>
<td>985</td>
<td>0002-9343</td>
<td>12950</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>Versus Arthritis</td>
<td>http://dx.doi.org/10.13039/501100000341</td>
<td>https://ror.org/02jkpm469</td>
<td>gb</td>
<td>https://www.versusarthritis.org/</td>
<td>14.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>61</td>
<td>0002-9343</td>
<td>3</td>
<td>1.0</td>
<td>223.0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8510</td>
<td>990</td>
<td>2211-2855</td>
<td>20490</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>European Research Council (ERC)</td>
<td>http://dx.doi.org/10.13039/501100000781</td>
<td>https://ror.org/0472cxd90</td>
<td>be</td>
<td>http://erc.europa.eu/</td>
<td>31.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>8511</td>
<td>2211-2855</td>
<td>3</td>
<td>1.0</td>
<td>352.0</td>
</tr>
<tr>
<td>8511</td>
<td>990</td>
<td>2211-2855</td>
<td>20490</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>Medical Research Council (MRC)</td>
<td>http://dx.doi.org/10.13039/501100000265</td>
<td>https://ror.org/03x94j517</td>
<td>gb</td>
<td>http://www.mrc.ac.uk/index.htm</td>
<td>705.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>8512</td>
<td>2211-2855</td>
<td>3</td>
<td>1.0</td>
<td>352.0</td>
</tr>
<tr>
<td>8512</td>
<td>990</td>
<td>2211-2855</td>
<td>20490</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>Motor Neuron Disease Association (MND Associat...</td>
<td>http://dx.doi.org/10.13039/501100000406</td>
<td>https://ror.org/02gq0fg61</td>
<td>gb</td>
<td>http://www.mndassociation.org/</td>
<td>562.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>8513</td>
<td>2211-2855</td>
<td>3</td>
<td>1.0</td>
<td>352.0</td>
</tr>
<tr>
<td>8513</td>
<td>990</td>
<td>2211-2855</td>
<td>20490</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>Parkinson's UK</td>
<td>http://dx.doi.org/10.13039/501100000304</td>
<td>https://ror.org/02417p338</td>
<td>gb</td>
<td>http://www.parkinsons.org.uk/</td>
<td>411.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>8514</td>
<td>2211-2855</td>
<td>3</td>
<td>1.0</td>
<td>352.0</td>
</tr>
<tr>
<td>8514</td>
<td>990</td>
<td>2211-2855</td>
<td>20490</td>
<td>https://v2.sherpa.ac.uk/id/publisher_policy/3323</td>
<td>no</td>
<td>yes</td>
<td>published</td>
<td>cc_by</td>
<td>0</td>
<td>NaN</td>
<td>True</td>
<td>Telethon Foundation</td>
<td>http://dx.doi.org/10.13039/501100002426</td>
<td>https://ror.org/04xraxn18</td>
<td>it</td>
<td>https://www.telethon.it/en/</td>
<td>325.0</td>
<td>NaN</td>
<td>institutional_repository ; named_repository ; ...</td>
<td>Institutional Repository</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>PubMed Central ; Research for Development Repo...</td>
<td>NaN</td>
<td>NaN</td>
<td>disciplinary (PubMed Central) ;</td>
<td>True</td>
<td>Published source must be acknowledged with cit...</td>
<td>NaN</td>
<td>8515</td>
<td>2211-2855</td>
<td>3</td>
<td>1.0</td>
<td>352.0</td>
</tr>
</tbody>
</table>
<p>5585 rows × 33 columns</p>
</div>
```python
sherpa['prerequisite_funders'].value_counts()
```
True 5585
Name: prerequisite_funders, dtype: int64
```python
funders = sherpa.loc[sherpa['prerequisite_funders'].notna()][['prerequisite_funders_name', 'prerequisite_funders_fundref', 'prerequisite_funders_ror', 'prerequisite_funders_country', 'prerequisite_funders_url', 'prerequisite_funders_sherpa_id']]
funders
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
</tr>
</thead>
<tbody>
<tr>
<td>16</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>us</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
</tr>
<tr>
<td>28</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>us</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
</tr>
<tr>
<td>58</td>
<td>Wellcome Trust</td>
<td>http://dx.doi.org/10.13039/100004440</td>
<td>https://ror.org/029chgv08</td>
<td>gb</td>
<td>http://www.wellcome.ac.uk/</td>
<td>695.0</td>
</tr>
<tr>
<td>59</td>
<td>British Heart Foundation (BHF)</td>
<td>http://dx.doi.org/10.13039/501100000274</td>
<td>https://ror.org/02wdwnk04</td>
<td>gb</td>
<td>http://www.bhf.org.uk/</td>
<td>18.0</td>
</tr>
<tr>
<td>60</td>
<td>Versus Arthritis</td>
<td>http://dx.doi.org/10.13039/501100000341</td>
<td>https://ror.org/02jkpm469</td>
<td>gb</td>
<td>https://www.versusarthritis.org/</td>
<td>14.0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8510</td>
<td>European Research Council (ERC)</td>
<td>http://dx.doi.org/10.13039/501100000781</td>
<td>https://ror.org/0472cxd90</td>
<td>be</td>
<td>http://erc.europa.eu/</td>
<td>31.0</td>
</tr>
<tr>
<td>8511</td>
<td>Medical Research Council (MRC)</td>
<td>http://dx.doi.org/10.13039/501100000265</td>
<td>https://ror.org/03x94j517</td>
<td>gb</td>
<td>http://www.mrc.ac.uk/index.htm</td>
<td>705.0</td>
</tr>
<tr>
<td>8512</td>
<td>Motor Neuron Disease Association (MND Associat...</td>
<td>http://dx.doi.org/10.13039/501100000406</td>
<td>https://ror.org/02gq0fg61</td>
<td>gb</td>
<td>http://www.mndassociation.org/</td>
<td>562.0</td>
</tr>
<tr>
<td>8513</td>
<td>Parkinson's UK</td>
<td>http://dx.doi.org/10.13039/501100000304</td>
<td>https://ror.org/02417p338</td>
<td>gb</td>
<td>http://www.parkinsons.org.uk/</td>
<td>411.0</td>
</tr>
<tr>
<td>8514</td>
<td>Telethon Foundation</td>
<td>http://dx.doi.org/10.13039/501100002426</td>
<td>https://ror.org/04xraxn18</td>
<td>it</td>
<td>https://www.telethon.it/en/</td>
<td>325.0</td>
</tr>
</tbody>
</table>
<p>5585 rows × 6 columns</p>
</div>
```python
funders_dedup = funders.drop_duplicates(subset='prerequisite_funders_ror')
funders_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>prerequisite_funders_name</th>
<th>prerequisite_funders_fundref</th>
<th>prerequisite_funders_ror</th>
<th>prerequisite_funders_country</th>
<th>prerequisite_funders_url</th>
<th>prerequisite_funders_sherpa_id</th>
</tr>
</thead>
<tbody>
<tr>
<td>16</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>us</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
</tr>
<tr>
<td>58</td>
<td>Wellcome Trust</td>
<td>http://dx.doi.org/10.13039/100004440</td>
<td>https://ror.org/029chgv08</td>
<td>gb</td>
<td>http://www.wellcome.ac.uk/</td>
<td>695.0</td>
</tr>
<tr>
<td>59</td>
<td>British Heart Foundation (BHF)</td>
<td>http://dx.doi.org/10.13039/501100000274</td>
<td>https://ror.org/02wdwnk04</td>
<td>gb</td>
<td>http://www.bhf.org.uk/</td>
<td>18.0</td>
</tr>
<tr>
<td>60</td>
<td>Versus Arthritis</td>
<td>http://dx.doi.org/10.13039/501100000341</td>
<td>https://ror.org/02jkpm469</td>
<td>gb</td>
<td>https://www.versusarthritis.org/</td>
<td>14.0</td>
</tr>
<tr>
<td>61</td>
<td>Biotechnology and Biological Sciences Research...</td>
<td>http://dx.doi.org/10.13039/501100000268</td>
<td>https://ror.org/00cwqg982</td>
<td>gb</td>
<td>http://www.bbsrc.ac.uk/home/home.aspx</td>
<td>709.0</td>
</tr>
<tr>
<td>62</td>
<td>Blood Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100007903</td>
<td>https://ror.org/0055acf80</td>
<td>gb</td>
<td>https://bloodcancer.org.uk/</td>
<td>925.0</td>
</tr>
<tr>
<td>63</td>
<td>Bill &amp; Melinda Gates Foundation</td>
<td>http://dx.doi.org/10.13039/100000865</td>
<td>https://ror.org/0456r8d26</td>
<td>us</td>
<td>http://www.gatesfoundation.org/</td>
<td>961.0</td>
</tr>
<tr>
<td>64</td>
<td>Cancer Research UK</td>
<td>http://dx.doi.org/10.13039/501100000289</td>
<td>https://ror.org/054225q67</td>
<td>gb</td>
<td>http://www.cancerresearchuk.org/</td>
<td>19.0</td>
</tr>
<tr>
<td>65</td>
<td>Chief Scientist Office, Scottish Executive (CSO)</td>
<td>http://dx.doi.org/10.13039/501100000589</td>
<td>https://ror.org/01613vh25</td>
<td>gb</td>
<td>http://www.cso.scot.nhs.uk/</td>
<td>16.0</td>
</tr>
<tr>
<td>66</td>
<td>Department of Health (DH)</td>
<td>http://dx.doi.org/10.13039/501100000272</td>
<td>https://ror.org/0187kwz08</td>
<td>gb</td>
<td>http://www.dh.gov.uk/en/index.htm</td>
<td>943.0</td>
</tr>
<tr>
<td>67</td>
<td>Dunhill Medical Trust (DMT)</td>
<td>http://dx.doi.org/10.13039/501100000377</td>
<td>https://ror.org/05ayqqv15</td>
<td>gb</td>
<td>https://dunhillmedical.org.uk/</td>
<td>410.0</td>
</tr>
<tr>
<td>68</td>
<td>European Research Council (ERC)</td>
<td>http://dx.doi.org/10.13039/501100000781</td>
<td>https://ror.org/0472cxd90</td>
<td>be</td>
<td>http://erc.europa.eu/</td>
<td>31.0</td>
</tr>
<tr>
<td>69</td>
<td>Medical Research Council (MRC)</td>
<td>http://dx.doi.org/10.13039/501100000265</td>
<td>https://ror.org/03x94j517</td>
<td>gb</td>
<td>http://www.mrc.ac.uk/index.htm</td>
<td>705.0</td>
</tr>
<tr>
<td>70</td>
<td>Motor Neuron Disease Association (MND Associat...</td>
<td>http://dx.doi.org/10.13039/501100000406</td>
<td>https://ror.org/02gq0fg61</td>
<td>gb</td>
<td>http://www.mndassociation.org/</td>
<td>562.0</td>
</tr>
<tr>
<td>71</td>
<td>Parkinson's UK</td>
<td>http://dx.doi.org/10.13039/501100000304</td>
<td>https://ror.org/02417p338</td>
<td>gb</td>
<td>http://www.parkinsons.org.uk/</td>
<td>411.0</td>
</tr>
<tr>
<td>72</td>
<td>Telethon Foundation</td>
<td>http://dx.doi.org/10.13039/501100002426</td>
<td>https://ror.org/04xraxn18</td>
<td>it</td>
<td>https://www.telethon.it/en/</td>
<td>325.0</td>
</tr>
<tr>
<td>99</td>
<td>Howard Hughes Medical Institute (HHMI)</td>
<td>http://dx.doi.org/10.13039/100000011</td>
<td>https://ror.org/006w34k90</td>
<td>us</td>
<td>http://www.hhmi.org/</td>
<td>24.0</td>
</tr>
<tr>
<td>149</td>
<td>Arts and Humanities Research Council (AHRC)</td>
<td>http://dx.doi.org/10.13039/501100000267</td>
<td>https://ror.org/0505m1554</td>
<td>gb</td>
<td>http://www.ahrc.ac.uk/Pages/Home.aspx</td>
<td>698.0</td>
</tr>
<tr>
<td>150</td>
<td>Austrian Science Fund (FWF)</td>
<td>http://dx.doi.org/10.13039/501100002428</td>
<td>https://ror.org/013tf3c58</td>
<td>at</td>
<td>http://www.fwf.ac.at/en/</td>
<td>13.0</td>
</tr>
<tr>
<td>153</td>
<td>Breast Cancer Now</td>
<td>http://dx.doi.org/10.13039/501100007913</td>
<td>https://ror.org/02qa92s63</td>
<td>gb</td>
<td>http://breastcancernow.org/</td>
<td>1065.0</td>
</tr>
<tr>
<td>156</td>
<td>Engineering and Physical Sciences Research Cou...</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/0439y7842</td>
<td>gb</td>
<td>http://www.epsrc.ac.uk/Pages/default.aspx</td>
<td>722.0</td>
</tr>
<tr>
<td>159</td>
<td>Natural Environment Research Council (NERC)</td>
<td>http://dx.doi.org/10.13039/501100000270</td>
<td>https://ror.org/02b5d8509</td>
<td>gb</td>
<td>https://nerc.ukri.org/</td>
<td>726.0</td>
</tr>
<tr>
<td>162</td>
<td>Science and Technology Facilities Council (STFC)</td>
<td>http://dx.doi.org/10.13039/501100000271</td>
<td>https://ror.org/057g20z61</td>
<td>gb</td>
<td>http://www.stfc.ac.uk/</td>
<td>716.0</td>
</tr>
<tr>
<td>164</td>
<td>Vetenskapsrådet</td>
<td>http://dx.doi.org/10.13039/501100004359</td>
<td>https://ror.org/03zttf063</td>
<td>se</td>
<td>http://www.vr.se/</td>
<td>302.0</td>
</tr>
<tr>
<td>165</td>
<td>World Health Organization (WHO)</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/01f80g185</td>
<td>ch</td>
<td>http://www.who.int/</td>
<td>903.0</td>
</tr>
<tr>
<td>166</td>
<td>World Bank</td>
<td>http://dx.doi.org/10.13039/100004421</td>
<td>https://ror.org/00ae7jd04</td>
<td>us</td>
<td>http://www.worldbank.org/</td>
<td>525.0</td>
</tr>
<tr>
<td>167</td>
<td>Yorkshire Cancer Research</td>
<td>http://dx.doi.org/10.13039/501100002653</td>
<td>https://ror.org/02cddnn97</td>
<td>gb</td>
<td>http://www.yorkshirecancerresearch.org.uk/</td>
<td>428.0</td>
</tr>
<tr>
<td>169</td>
<td>Economic and Social Research Council (ESRC)</td>
<td>http://dx.doi.org/10.13039/501100000269</td>
<td>https://ror.org/03n0ht308</td>
<td>gb</td>
<td>http://www.esrc.ac.uk/</td>
<td>717.0</td>
</tr>
<tr>
<td>418</td>
<td>Higher Education Funding Council for England (...</td>
<td>http://dx.doi.org/10.13039/501100000384</td>
<td>https://ror.org/02wxr8x18</td>
<td>gb</td>
<td>http://www.hefce.ac.uk/</td>
<td>877.0</td>
</tr>
<tr>
<td>419</td>
<td>Higher Education Funding Council for Wales (HE...</td>
<td>http://dx.doi.org/10.13039/501100000383</td>
<td>https://ror.org/056y81r79</td>
<td>gb</td>
<td>http://www.hefcw.ac.uk/home/home.aspx</td>
<td>881.0</td>
</tr>
<tr>
<td>420</td>
<td>Scottish Funding Council (SFC)</td>
<td>http://dx.doi.org/10.13039/501100000360</td>
<td>https://ror.org/056bwcz71</td>
<td>gb</td>
<td>http://www.sfc.ac.uk/</td>
<td>887.0</td>
</tr>
<tr>
<td>421</td>
<td>Department for the Economy, Northern Ireland</td>
<td>http://dx.doi.org/10.13039/100008303</td>
<td>https://ror.org/05w9mt194</td>
<td>gb</td>
<td>https://www.economy-ni.gov.uk/</td>
<td>884.0</td>
</tr>
<tr>
<td>960</td>
<td>Academy of Finland</td>
<td>http://dx.doi.org/10.13039/501100002341</td>
<td>https://ror.org/05k73zm37</td>
<td>fi</td>
<td>https://www.aka.fi/en/</td>
<td>1248.0</td>
</tr>
<tr>
<td>961</td>
<td>Agence Nationale de la Recherche (ANR)</td>
<td>http://dx.doi.org/10.13039/501100001665</td>
<td>https://ror.org/00rbzpz17</td>
<td>fr</td>
<td>http://www.agence-nationale-recherche.fr/</td>
<td>30.0</td>
</tr>
<tr>
<td>963</td>
<td>Fundação para a Ciência e a Tecnologia</td>
<td>http://dx.doi.org/10.13039/501100001871</td>
<td>https://ror.org/00snfqn58</td>
<td>pt</td>
<td>http://www.fct.pt/</td>
<td>1109.0</td>
</tr>
<tr>
<td>964</td>
<td>Formas</td>
<td>http://dx.doi.org/10.13039/501100001862</td>
<td>https://ror.org/03pjs1y45</td>
<td>se</td>
<td>http://www.formas.se/</td>
<td>452.0</td>
</tr>
<tr>
<td>967</td>
<td>Nederlandse Organisatie voor Wetenschappelijk ...</td>
<td>http://dx.doi.org/10.13039/501100003246</td>
<td>https://ror.org/04jsz6e67</td>
<td>nl</td>
<td>http://www.nwo.nl/</td>
<td>459.0</td>
</tr>
<tr>
<td>968</td>
<td>Science Foundation Ireland (SFI)</td>
<td>http://dx.doi.org/10.13039/501100001602</td>
<td>https://ror.org/0271asj38</td>
<td>ie</td>
<td>http://www.sfi.ie/</td>
<td>210.0</td>
</tr>
<tr>
<td>970</td>
<td>Research Council of Norway</td>
<td>http://dx.doi.org/10.13039/501100005416</td>
<td>https://ror.org/00epmv149</td>
<td>no</td>
<td>https://www.forskningsradet.no/en/</td>
<td>266.0</td>
</tr>
<tr>
<td>971</td>
<td>Forskningsrådet för hälsa, arbetsliv och välfä...</td>
<td>http://dx.doi.org/10.13039/501100006636</td>
<td>https://ror.org/02d290r06</td>
<td>se</td>
<td>http://www.forte.se/</td>
<td>455.0</td>
</tr>
<tr>
<td>978</td>
<td>Innovate UK</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/05ar5fy68</td>
<td>gb</td>
<td>https://www.gov.uk/government/organisations/in...</td>
<td>1267.0</td>
</tr>
<tr>
<td>1048</td>
<td>Diabetes UK</td>
<td>http://dx.doi.org/10.13039/501100000361</td>
<td>https://ror.org/050rgn017</td>
<td>gb</td>
<td>http://www.diabetes.org.uk/</td>
<td>492.0</td>
</tr>
<tr>
<td>1052</td>
<td>Marie Curie</td>
<td>http://dx.doi.org/10.13039/501100000654</td>
<td>https://ror.org/02aqv1x10</td>
<td>gb</td>
<td>http://www.mariecurie.org.uk/</td>
<td>595.0</td>
</tr>
<tr>
<td>1055</td>
<td>Action on Hearing Loss</td>
<td>http://dx.doi.org/10.13039/501100000703</td>
<td>https://ror.org/05w6qh410</td>
<td>gb</td>
<td>http://www.actiononhearingloss.org.uk/</td>
<td>412.0</td>
</tr>
<tr>
<td>1056</td>
<td>Alzheimer's Society</td>
<td>http://dx.doi.org/10.13039/501100000320</td>
<td>https://ror.org/0472gwq90</td>
<td>gb</td>
<td>http://alzheimers.org.uk/</td>
<td>443.0</td>
</tr>
<tr>
<td>1063</td>
<td>Multiple Sclerosis Society</td>
<td>http://dx.doi.org/10.13039/501100000381</td>
<td>https://ror.org/043fwdk81</td>
<td>gb</td>
<td>http://www.mssociety.org.uk/</td>
<td>745.0</td>
</tr>
<tr>
<td>1064</td>
<td>Myrovlytis Trust</td>
<td>http://dx.doi.org/10.13039/501100001291</td>
<td>https://ror.org/05bj02613</td>
<td>gb</td>
<td>http://www.myrovlytistrust.org/</td>
<td>858.0</td>
</tr>
<tr>
<td>1065</td>
<td>National Centre for the Replacement, Refinemen...</td>
<td>http://dx.doi.org/10.13039/501100000849</td>
<td>https://ror.org/02w0kg036</td>
<td>gb</td>
<td>http://www.nc3rs.org.uk/</td>
<td>859.0</td>
</tr>
<tr>
<td>1072</td>
<td>Worldwide Cancer Reseach</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/031tfbz57</td>
<td>gb</td>
<td>http://www.worldwidecancerresearch.org/</td>
<td>425.0</td>
</tr>
<tr>
<td>2219</td>
<td>Canadian Institutes of Health Research (CIHR)</td>
<td>http://dx.doi.org/10.13039/501100000024</td>
<td>https://ror.org/01gavpb45</td>
<td>ca</td>
<td>http://www.cihr-irsc.gc.ca/</td>
<td>28.0</td>
</tr>
<tr>
<td>5490</td>
<td>US Department of Energy (DOE)</td>
<td>http://dx.doi.org/10.13039/100000015</td>
<td>https://ror.org/01bj3aw27</td>
<td>us</td>
<td>http://energy.gov/</td>
<td>962.0</td>
</tr>
<tr>
<td>5491</td>
<td>Agency for Healthcare Research and Quality (AHRQ)</td>
<td>http://dx.doi.org/10.13039/100000133</td>
<td>https://ror.org/03jmfdf59</td>
<td>us</td>
<td>http://www.ahrq.gov/index.html</td>
<td>981.0</td>
</tr>
<tr>
<td>5492</td>
<td>Institute of Education Sciences (IES)</td>
<td>http://dx.doi.org/10.13039/100005246</td>
<td>https://ror.org/04et59085</td>
<td>us</td>
<td>http://ies.ed.gov/</td>
<td>291.0</td>
</tr>
<tr>
<td>5493</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://dx.doi.org/10.13039/100000104</td>
<td>https://ror.org/027ka1x80</td>
<td>us</td>
<td>http://science.nasa.gov/</td>
<td>986.0</td>
</tr>
<tr>
<td>5494</td>
<td>National Science Foundation (NSF)</td>
<td>http://dx.doi.org/10.13039/100000001</td>
<td>https://ror.org/021nxhr62</td>
<td>us</td>
<td>http://www.nsf.gov/</td>
<td>354.0</td>
</tr>
<tr>
<td>7232</td>
<td>Academy of Medical Science</td>
<td>http://dx.doi.org/10.13039/501100000691</td>
<td>https://ror.org/00c489v88</td>
<td>gb</td>
<td>https://acmedsci.ac.uk/</td>
<td>1125.0</td>
</tr>
<tr>
<td>7239</td>
<td>Prostate Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100000771</td>
<td>https://ror.org/04dkv6329</td>
<td>gb</td>
<td>http://prostatecanceruk.org/</td>
<td>742.0</td>
</tr>
<tr>
<td>7240</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://dx.doi.org/10.13039/501100001711</td>
<td>https://ror.org/00yjd3n13</td>
<td>ch</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>25.0</td>
</tr>
</tbody>
</table>
</div>
```python
funders_dedup.shape[0]
```
58
```python
# export excel
funders_dedup.to_excel('sample/funders.xlsx', index=False)
```
```python
# export csv
funders_dedup.to_csv('sample/funders.tsv', index=False)
```
```python
# creation du DF
organization_funders = funders_dedup
organization_funders = organization_funders.rename(columns = {'prerequisite_funders_name' : 'name',
'prerequisite_funders_fundref' : 'fundref',
'prerequisite_funders_ror' : 'ror',
'prerequisite_funders_country' : 'iso_code',
'prerequisite_funders_url' : 'website',
'prerequisite_funders_sherpa_id' : 'sherpa_id'
})
organization_funders
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>fundref</th>
<th>ror</th>
<th>iso_code</th>
<th>website</th>
<th>sherpa_id</th>
</tr>
</thead>
<tbody>
<tr>
<td>16</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>us</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
</tr>
<tr>
<td>58</td>
<td>Wellcome Trust</td>
<td>http://dx.doi.org/10.13039/100004440</td>
<td>https://ror.org/029chgv08</td>
<td>gb</td>
<td>http://www.wellcome.ac.uk/</td>
<td>695.0</td>
</tr>
<tr>
<td>59</td>
<td>British Heart Foundation (BHF)</td>
<td>http://dx.doi.org/10.13039/501100000274</td>
<td>https://ror.org/02wdwnk04</td>
<td>gb</td>
<td>http://www.bhf.org.uk/</td>
<td>18.0</td>
</tr>
<tr>
<td>60</td>
<td>Versus Arthritis</td>
<td>http://dx.doi.org/10.13039/501100000341</td>
<td>https://ror.org/02jkpm469</td>
<td>gb</td>
<td>https://www.versusarthritis.org/</td>
<td>14.0</td>
</tr>
<tr>
<td>61</td>
<td>Biotechnology and Biological Sciences Research...</td>
<td>http://dx.doi.org/10.13039/501100000268</td>
<td>https://ror.org/00cwqg982</td>
<td>gb</td>
<td>http://www.bbsrc.ac.uk/home/home.aspx</td>
<td>709.0</td>
</tr>
<tr>
<td>62</td>
<td>Blood Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100007903</td>
<td>https://ror.org/0055acf80</td>
<td>gb</td>
<td>https://bloodcancer.org.uk/</td>
<td>925.0</td>
</tr>
<tr>
<td>63</td>
<td>Bill &amp; Melinda Gates Foundation</td>
<td>http://dx.doi.org/10.13039/100000865</td>
<td>https://ror.org/0456r8d26</td>
<td>us</td>
<td>http://www.gatesfoundation.org/</td>
<td>961.0</td>
</tr>
<tr>
<td>64</td>
<td>Cancer Research UK</td>
<td>http://dx.doi.org/10.13039/501100000289</td>
<td>https://ror.org/054225q67</td>
<td>gb</td>
<td>http://www.cancerresearchuk.org/</td>
<td>19.0</td>
</tr>
<tr>
<td>65</td>
<td>Chief Scientist Office, Scottish Executive (CSO)</td>
<td>http://dx.doi.org/10.13039/501100000589</td>
<td>https://ror.org/01613vh25</td>
<td>gb</td>
<td>http://www.cso.scot.nhs.uk/</td>
<td>16.0</td>
</tr>
<tr>
<td>66</td>
<td>Department of Health (DH)</td>
<td>http://dx.doi.org/10.13039/501100000272</td>
<td>https://ror.org/0187kwz08</td>
<td>gb</td>
<td>http://www.dh.gov.uk/en/index.htm</td>
<td>943.0</td>
</tr>
<tr>
<td>67</td>
<td>Dunhill Medical Trust (DMT)</td>
<td>http://dx.doi.org/10.13039/501100000377</td>
<td>https://ror.org/05ayqqv15</td>
<td>gb</td>
<td>https://dunhillmedical.org.uk/</td>
<td>410.0</td>
</tr>
<tr>
<td>68</td>
<td>European Research Council (ERC)</td>
<td>http://dx.doi.org/10.13039/501100000781</td>
<td>https://ror.org/0472cxd90</td>
<td>be</td>
<td>http://erc.europa.eu/</td>
<td>31.0</td>
</tr>
<tr>
<td>69</td>
<td>Medical Research Council (MRC)</td>
<td>http://dx.doi.org/10.13039/501100000265</td>
<td>https://ror.org/03x94j517</td>
<td>gb</td>
<td>http://www.mrc.ac.uk/index.htm</td>
<td>705.0</td>
</tr>
<tr>
<td>70</td>
<td>Motor Neuron Disease Association (MND Associat...</td>
<td>http://dx.doi.org/10.13039/501100000406</td>
<td>https://ror.org/02gq0fg61</td>
<td>gb</td>
<td>http://www.mndassociation.org/</td>
<td>562.0</td>
</tr>
<tr>
<td>71</td>
<td>Parkinson's UK</td>
<td>http://dx.doi.org/10.13039/501100000304</td>
<td>https://ror.org/02417p338</td>
<td>gb</td>
<td>http://www.parkinsons.org.uk/</td>
<td>411.0</td>
</tr>
<tr>
<td>72</td>
<td>Telethon Foundation</td>
<td>http://dx.doi.org/10.13039/501100002426</td>
<td>https://ror.org/04xraxn18</td>
<td>it</td>
<td>https://www.telethon.it/en/</td>
<td>325.0</td>
</tr>
<tr>
<td>99</td>
<td>Howard Hughes Medical Institute (HHMI)</td>
<td>http://dx.doi.org/10.13039/100000011</td>
<td>https://ror.org/006w34k90</td>
<td>us</td>
<td>http://www.hhmi.org/</td>
<td>24.0</td>
</tr>
<tr>
<td>149</td>
<td>Arts and Humanities Research Council (AHRC)</td>
<td>http://dx.doi.org/10.13039/501100000267</td>
<td>https://ror.org/0505m1554</td>
<td>gb</td>
<td>http://www.ahrc.ac.uk/Pages/Home.aspx</td>
<td>698.0</td>
</tr>
<tr>
<td>150</td>
<td>Austrian Science Fund (FWF)</td>
<td>http://dx.doi.org/10.13039/501100002428</td>
<td>https://ror.org/013tf3c58</td>
<td>at</td>
<td>http://www.fwf.ac.at/en/</td>
<td>13.0</td>
</tr>
<tr>
<td>153</td>
<td>Breast Cancer Now</td>
<td>http://dx.doi.org/10.13039/501100007913</td>
<td>https://ror.org/02qa92s63</td>
<td>gb</td>
<td>http://breastcancernow.org/</td>
<td>1065.0</td>
</tr>
<tr>
<td>156</td>
<td>Engineering and Physical Sciences Research Cou...</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/0439y7842</td>
<td>gb</td>
<td>http://www.epsrc.ac.uk/Pages/default.aspx</td>
<td>722.0</td>
</tr>
<tr>
<td>159</td>
<td>Natural Environment Research Council (NERC)</td>
<td>http://dx.doi.org/10.13039/501100000270</td>
<td>https://ror.org/02b5d8509</td>
<td>gb</td>
<td>https://nerc.ukri.org/</td>
<td>726.0</td>
</tr>
<tr>
<td>162</td>
<td>Science and Technology Facilities Council (STFC)</td>
<td>http://dx.doi.org/10.13039/501100000271</td>
<td>https://ror.org/057g20z61</td>
<td>gb</td>
<td>http://www.stfc.ac.uk/</td>
<td>716.0</td>
</tr>
<tr>
<td>164</td>
<td>Vetenskapsrådet</td>
<td>http://dx.doi.org/10.13039/501100004359</td>
<td>https://ror.org/03zttf063</td>
<td>se</td>
<td>http://www.vr.se/</td>
<td>302.0</td>
</tr>
<tr>
<td>165</td>
<td>World Health Organization (WHO)</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/01f80g185</td>
<td>ch</td>
<td>http://www.who.int/</td>
<td>903.0</td>
</tr>
<tr>
<td>166</td>
<td>World Bank</td>
<td>http://dx.doi.org/10.13039/100004421</td>
<td>https://ror.org/00ae7jd04</td>
<td>us</td>
<td>http://www.worldbank.org/</td>
<td>525.0</td>
</tr>
<tr>
<td>167</td>
<td>Yorkshire Cancer Research</td>
<td>http://dx.doi.org/10.13039/501100002653</td>
<td>https://ror.org/02cddnn97</td>
<td>gb</td>
<td>http://www.yorkshirecancerresearch.org.uk/</td>
<td>428.0</td>
</tr>
<tr>
<td>169</td>
<td>Economic and Social Research Council (ESRC)</td>
<td>http://dx.doi.org/10.13039/501100000269</td>
<td>https://ror.org/03n0ht308</td>
<td>gb</td>
<td>http://www.esrc.ac.uk/</td>
<td>717.0</td>
</tr>
<tr>
<td>418</td>
<td>Higher Education Funding Council for England (...</td>
<td>http://dx.doi.org/10.13039/501100000384</td>
<td>https://ror.org/02wxr8x18</td>
<td>gb</td>
<td>http://www.hefce.ac.uk/</td>
<td>877.0</td>
</tr>
<tr>
<td>419</td>
<td>Higher Education Funding Council for Wales (HE...</td>
<td>http://dx.doi.org/10.13039/501100000383</td>
<td>https://ror.org/056y81r79</td>
<td>gb</td>
<td>http://www.hefcw.ac.uk/home/home.aspx</td>
<td>881.0</td>
</tr>
<tr>
<td>420</td>
<td>Scottish Funding Council (SFC)</td>
<td>http://dx.doi.org/10.13039/501100000360</td>
<td>https://ror.org/056bwcz71</td>
<td>gb</td>
<td>http://www.sfc.ac.uk/</td>
<td>887.0</td>
</tr>
<tr>
<td>421</td>
<td>Department for the Economy, Northern Ireland</td>
<td>http://dx.doi.org/10.13039/100008303</td>
<td>https://ror.org/05w9mt194</td>
<td>gb</td>
<td>https://www.economy-ni.gov.uk/</td>
<td>884.0</td>
</tr>
<tr>
<td>960</td>
<td>Academy of Finland</td>
<td>http://dx.doi.org/10.13039/501100002341</td>
<td>https://ror.org/05k73zm37</td>
<td>fi</td>
<td>https://www.aka.fi/en/</td>
<td>1248.0</td>
</tr>
<tr>
<td>961</td>
<td>Agence Nationale de la Recherche (ANR)</td>
<td>http://dx.doi.org/10.13039/501100001665</td>
<td>https://ror.org/00rbzpz17</td>
<td>fr</td>
<td>http://www.agence-nationale-recherche.fr/</td>
<td>30.0</td>
</tr>
<tr>
<td>963</td>
<td>Fundação para a Ciência e a Tecnologia</td>
<td>http://dx.doi.org/10.13039/501100001871</td>
<td>https://ror.org/00snfqn58</td>
<td>pt</td>
<td>http://www.fct.pt/</td>
<td>1109.0</td>
</tr>
<tr>
<td>964</td>
<td>Formas</td>
<td>http://dx.doi.org/10.13039/501100001862</td>
<td>https://ror.org/03pjs1y45</td>
<td>se</td>
<td>http://www.formas.se/</td>
<td>452.0</td>
</tr>
<tr>
<td>967</td>
<td>Nederlandse Organisatie voor Wetenschappelijk ...</td>
<td>http://dx.doi.org/10.13039/501100003246</td>
<td>https://ror.org/04jsz6e67</td>
<td>nl</td>
<td>http://www.nwo.nl/</td>
<td>459.0</td>
</tr>
<tr>
<td>968</td>
<td>Science Foundation Ireland (SFI)</td>
<td>http://dx.doi.org/10.13039/501100001602</td>
<td>https://ror.org/0271asj38</td>
<td>ie</td>
<td>http://www.sfi.ie/</td>
<td>210.0</td>
</tr>
<tr>
<td>970</td>
<td>Research Council of Norway</td>
<td>http://dx.doi.org/10.13039/501100005416</td>
<td>https://ror.org/00epmv149</td>
<td>no</td>
<td>https://www.forskningsradet.no/en/</td>
<td>266.0</td>
</tr>
<tr>
<td>971</td>
<td>Forskningsrådet för hälsa, arbetsliv och välfä...</td>
<td>http://dx.doi.org/10.13039/501100006636</td>
<td>https://ror.org/02d290r06</td>
<td>se</td>
<td>http://www.forte.se/</td>
<td>455.0</td>
</tr>
<tr>
<td>978</td>
<td>Innovate UK</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/05ar5fy68</td>
<td>gb</td>
<td>https://www.gov.uk/government/organisations/in...</td>
<td>1267.0</td>
</tr>
<tr>
<td>1048</td>
<td>Diabetes UK</td>
<td>http://dx.doi.org/10.13039/501100000361</td>
<td>https://ror.org/050rgn017</td>
<td>gb</td>
<td>http://www.diabetes.org.uk/</td>
<td>492.0</td>
</tr>
<tr>
<td>1052</td>
<td>Marie Curie</td>
<td>http://dx.doi.org/10.13039/501100000654</td>
<td>https://ror.org/02aqv1x10</td>
<td>gb</td>
<td>http://www.mariecurie.org.uk/</td>
<td>595.0</td>
</tr>
<tr>
<td>1055</td>
<td>Action on Hearing Loss</td>
<td>http://dx.doi.org/10.13039/501100000703</td>
<td>https://ror.org/05w6qh410</td>
<td>gb</td>
<td>http://www.actiononhearingloss.org.uk/</td>
<td>412.0</td>
</tr>
<tr>
<td>1056</td>
<td>Alzheimer's Society</td>
<td>http://dx.doi.org/10.13039/501100000320</td>
<td>https://ror.org/0472gwq90</td>
<td>gb</td>
<td>http://alzheimers.org.uk/</td>
<td>443.0</td>
</tr>
<tr>
<td>1063</td>
<td>Multiple Sclerosis Society</td>
<td>http://dx.doi.org/10.13039/501100000381</td>
<td>https://ror.org/043fwdk81</td>
<td>gb</td>
<td>http://www.mssociety.org.uk/</td>
<td>745.0</td>
</tr>
<tr>
<td>1064</td>
<td>Myrovlytis Trust</td>
<td>http://dx.doi.org/10.13039/501100001291</td>
<td>https://ror.org/05bj02613</td>
<td>gb</td>
<td>http://www.myrovlytistrust.org/</td>
<td>858.0</td>
</tr>
<tr>
<td>1065</td>
<td>National Centre for the Replacement, Refinemen...</td>
<td>http://dx.doi.org/10.13039/501100000849</td>
<td>https://ror.org/02w0kg036</td>
<td>gb</td>
<td>http://www.nc3rs.org.uk/</td>
<td>859.0</td>
</tr>
<tr>
<td>1072</td>
<td>Worldwide Cancer Reseach</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/031tfbz57</td>
<td>gb</td>
<td>http://www.worldwidecancerresearch.org/</td>
<td>425.0</td>
</tr>
<tr>
<td>2219</td>
<td>Canadian Institutes of Health Research (CIHR)</td>
<td>http://dx.doi.org/10.13039/501100000024</td>
<td>https://ror.org/01gavpb45</td>
<td>ca</td>
<td>http://www.cihr-irsc.gc.ca/</td>
<td>28.0</td>
</tr>
<tr>
<td>5490</td>
<td>US Department of Energy (DOE)</td>
<td>http://dx.doi.org/10.13039/100000015</td>
<td>https://ror.org/01bj3aw27</td>
<td>us</td>
<td>http://energy.gov/</td>
<td>962.0</td>
</tr>
<tr>
<td>5491</td>
<td>Agency for Healthcare Research and Quality (AHRQ)</td>
<td>http://dx.doi.org/10.13039/100000133</td>
<td>https://ror.org/03jmfdf59</td>
<td>us</td>
<td>http://www.ahrq.gov/index.html</td>
<td>981.0</td>
</tr>
<tr>
<td>5492</td>
<td>Institute of Education Sciences (IES)</td>
<td>http://dx.doi.org/10.13039/100005246</td>
<td>https://ror.org/04et59085</td>
<td>us</td>
<td>http://ies.ed.gov/</td>
<td>291.0</td>
</tr>
<tr>
<td>5493</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://dx.doi.org/10.13039/100000104</td>
<td>https://ror.org/027ka1x80</td>
<td>us</td>
<td>http://science.nasa.gov/</td>
<td>986.0</td>
</tr>
<tr>
<td>5494</td>
<td>National Science Foundation (NSF)</td>
<td>http://dx.doi.org/10.13039/100000001</td>
<td>https://ror.org/021nxhr62</td>
<td>us</td>
<td>http://www.nsf.gov/</td>
<td>354.0</td>
</tr>
<tr>
<td>7232</td>
<td>Academy of Medical Science</td>
<td>http://dx.doi.org/10.13039/501100000691</td>
<td>https://ror.org/00c489v88</td>
<td>gb</td>
<td>https://acmedsci.ac.uk/</td>
<td>1125.0</td>
</tr>
<tr>
<td>7239</td>
<td>Prostate Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100000771</td>
<td>https://ror.org/04dkv6329</td>
<td>gb</td>
<td>http://prostatecanceruk.org/</td>
<td>742.0</td>
</tr>
<tr>
<td>7240</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://dx.doi.org/10.13039/501100001711</td>
<td>https://ror.org/00yjd3n13</td>
<td>ch</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>25.0</td>
</tr>
</tbody>
</table>
</div>
```python
# lien avec les pays
country = pd.read_csv('sample/country.tsv', encoding='utf-8', header=0, sep='\t')
country
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>iso_code</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>Afghanistan</td>
<td>AF</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>Albania</td>
<td>AL</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>Algeria</td>
<td>DZ</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>American Samoa</td>
<td>AS</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>Andorra</td>
<td>AD</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>246</td>
<td>Zambia</td>
<td>ZM</td>
<td>247</td>
</tr>
<tr>
<td>247</td>
<td>Zimbabwe</td>
<td>ZW</td>
<td>248</td>
</tr>
<tr>
<td>248</td>
<td>Åland Islands</td>
<td>AX</td>
<td>249</td>
</tr>
<tr>
<td>249</td>
<td>International Agency</td>
<td>OI</td>
<td>250</td>
</tr>
<tr>
<td>250</td>
<td>UNKNOWN</td>
<td>__</td>
<td>999999</td>
</tr>
</tbody>
</table>
<p>251 rows × 3 columns</p>
</div>
```python
# merge avec les pays
organization_funders['iso_code'] = organization_funders['iso_code'].str.upper()
organization_funders['is_funder'] = 1
organization_funders = pd.merge(organization_funders, country[['iso_code', 'id']], how='left', on='iso_code')
organization_funders
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>fundref</th>
<th>ror</th>
<th>iso_code</th>
<th>website</th>
<th>sherpa_id</th>
<th>is_funder</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>US</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>1</td>
<td>Wellcome Trust</td>
<td>http://dx.doi.org/10.13039/100004440</td>
<td>https://ror.org/029chgv08</td>
<td>GB</td>
<td>http://www.wellcome.ac.uk/</td>
<td>695.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>2</td>
<td>British Heart Foundation (BHF)</td>
<td>http://dx.doi.org/10.13039/501100000274</td>
<td>https://ror.org/02wdwnk04</td>
<td>GB</td>
<td>http://www.bhf.org.uk/</td>
<td>18.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>3</td>
<td>Versus Arthritis</td>
<td>http://dx.doi.org/10.13039/501100000341</td>
<td>https://ror.org/02jkpm469</td>
<td>GB</td>
<td>https://www.versusarthritis.org/</td>
<td>14.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>4</td>
<td>Biotechnology and Biological Sciences Research...</td>
<td>http://dx.doi.org/10.13039/501100000268</td>
<td>https://ror.org/00cwqg982</td>
<td>GB</td>
<td>http://www.bbsrc.ac.uk/home/home.aspx</td>
<td>709.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>5</td>
<td>Blood Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100007903</td>
<td>https://ror.org/0055acf80</td>
<td>GB</td>
<td>https://bloodcancer.org.uk/</td>
<td>925.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>6</td>
<td>Bill &amp; Melinda Gates Foundation</td>
<td>http://dx.doi.org/10.13039/100000865</td>
<td>https://ror.org/0456r8d26</td>
<td>US</td>
<td>http://www.gatesfoundation.org/</td>
<td>961.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>7</td>
<td>Cancer Research UK</td>
<td>http://dx.doi.org/10.13039/501100000289</td>
<td>https://ror.org/054225q67</td>
<td>GB</td>
<td>http://www.cancerresearchuk.org/</td>
<td>19.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>8</td>
<td>Chief Scientist Office, Scottish Executive (CSO)</td>
<td>http://dx.doi.org/10.13039/501100000589</td>
<td>https://ror.org/01613vh25</td>
<td>GB</td>
<td>http://www.cso.scot.nhs.uk/</td>
<td>16.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>9</td>
<td>Department of Health (DH)</td>
<td>http://dx.doi.org/10.13039/501100000272</td>
<td>https://ror.org/0187kwz08</td>
<td>GB</td>
<td>http://www.dh.gov.uk/en/index.htm</td>
<td>943.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>10</td>
<td>Dunhill Medical Trust (DMT)</td>
<td>http://dx.doi.org/10.13039/501100000377</td>
<td>https://ror.org/05ayqqv15</td>
<td>GB</td>
<td>https://dunhillmedical.org.uk/</td>
<td>410.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>11</td>
<td>European Research Council (ERC)</td>
<td>http://dx.doi.org/10.13039/501100000781</td>
<td>https://ror.org/0472cxd90</td>
<td>BE</td>
<td>http://erc.europa.eu/</td>
<td>31.0</td>
<td>1</td>
<td>21</td>
</tr>
<tr>
<td>12</td>
<td>Medical Research Council (MRC)</td>
<td>http://dx.doi.org/10.13039/501100000265</td>
<td>https://ror.org/03x94j517</td>
<td>GB</td>
<td>http://www.mrc.ac.uk/index.htm</td>
<td>705.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>13</td>
<td>Motor Neuron Disease Association (MND Associat...</td>
<td>http://dx.doi.org/10.13039/501100000406</td>
<td>https://ror.org/02gq0fg61</td>
<td>GB</td>
<td>http://www.mndassociation.org/</td>
<td>562.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>14</td>
<td>Parkinson's UK</td>
<td>http://dx.doi.org/10.13039/501100000304</td>
<td>https://ror.org/02417p338</td>
<td>GB</td>
<td>http://www.parkinsons.org.uk/</td>
<td>411.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>15</td>
<td>Telethon Foundation</td>
<td>http://dx.doi.org/10.13039/501100002426</td>
<td>https://ror.org/04xraxn18</td>
<td>IT</td>
<td>https://www.telethon.it/en/</td>
<td>325.0</td>
<td>1</td>
<td>110</td>
</tr>
<tr>
<td>16</td>
<td>Howard Hughes Medical Institute (HHMI)</td>
<td>http://dx.doi.org/10.13039/100000011</td>
<td>https://ror.org/006w34k90</td>
<td>US</td>
<td>http://www.hhmi.org/</td>
<td>24.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>17</td>
<td>Arts and Humanities Research Council (AHRC)</td>
<td>http://dx.doi.org/10.13039/501100000267</td>
<td>https://ror.org/0505m1554</td>
<td>GB</td>
<td>http://www.ahrc.ac.uk/Pages/Home.aspx</td>
<td>698.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>18</td>
<td>Austrian Science Fund (FWF)</td>
<td>http://dx.doi.org/10.13039/501100002428</td>
<td>https://ror.org/013tf3c58</td>
<td>AT</td>
<td>http://www.fwf.ac.at/en/</td>
<td>13.0</td>
<td>1</td>
<td>14</td>
</tr>
<tr>
<td>19</td>
<td>Breast Cancer Now</td>
<td>http://dx.doi.org/10.13039/501100007913</td>
<td>https://ror.org/02qa92s63</td>
<td>GB</td>
<td>http://breastcancernow.org/</td>
<td>1065.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>20</td>
<td>Engineering and Physical Sciences Research Cou...</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/0439y7842</td>
<td>GB</td>
<td>http://www.epsrc.ac.uk/Pages/default.aspx</td>
<td>722.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>21</td>
<td>Natural Environment Research Council (NERC)</td>
<td>http://dx.doi.org/10.13039/501100000270</td>
<td>https://ror.org/02b5d8509</td>
<td>GB</td>
<td>https://nerc.ukri.org/</td>
<td>726.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>22</td>
<td>Science and Technology Facilities Council (STFC)</td>
<td>http://dx.doi.org/10.13039/501100000271</td>
<td>https://ror.org/057g20z61</td>
<td>GB</td>
<td>http://www.stfc.ac.uk/</td>
<td>716.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>23</td>
<td>Vetenskapsrådet</td>
<td>http://dx.doi.org/10.13039/501100004359</td>
<td>https://ror.org/03zttf063</td>
<td>SE</td>
<td>http://www.vr.se/</td>
<td>302.0</td>
<td>1</td>
<td>214</td>
</tr>
<tr>
<td>24</td>
<td>World Health Organization (WHO)</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/01f80g185</td>
<td>CH</td>
<td>http://www.who.int/</td>
<td>903.0</td>
<td>1</td>
<td>215</td>
</tr>
<tr>
<td>25</td>
<td>World Bank</td>
<td>http://dx.doi.org/10.13039/100004421</td>
<td>https://ror.org/00ae7jd04</td>
<td>US</td>
<td>http://www.worldbank.org/</td>
<td>525.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>26</td>
<td>Yorkshire Cancer Research</td>
<td>http://dx.doi.org/10.13039/501100002653</td>
<td>https://ror.org/02cddnn97</td>
<td>GB</td>
<td>http://www.yorkshirecancerresearch.org.uk/</td>
<td>428.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>27</td>
<td>Economic and Social Research Council (ESRC)</td>
<td>http://dx.doi.org/10.13039/501100000269</td>
<td>https://ror.org/03n0ht308</td>
<td>GB</td>
<td>http://www.esrc.ac.uk/</td>
<td>717.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>28</td>
<td>Higher Education Funding Council for England (...</td>
<td>http://dx.doi.org/10.13039/501100000384</td>
<td>https://ror.org/02wxr8x18</td>
<td>GB</td>
<td>http://www.hefce.ac.uk/</td>
<td>877.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>29</td>
<td>Higher Education Funding Council for Wales (HE...</td>
<td>http://dx.doi.org/10.13039/501100000383</td>
<td>https://ror.org/056y81r79</td>
<td>GB</td>
<td>http://www.hefcw.ac.uk/home/home.aspx</td>
<td>881.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>30</td>
<td>Scottish Funding Council (SFC)</td>
<td>http://dx.doi.org/10.13039/501100000360</td>
<td>https://ror.org/056bwcz71</td>
<td>GB</td>
<td>http://www.sfc.ac.uk/</td>
<td>887.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>31</td>
<td>Department for the Economy, Northern Ireland</td>
<td>http://dx.doi.org/10.13039/100008303</td>
<td>https://ror.org/05w9mt194</td>
<td>GB</td>
<td>https://www.economy-ni.gov.uk/</td>
<td>884.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>32</td>
<td>Academy of Finland</td>
<td>http://dx.doi.org/10.13039/501100002341</td>
<td>https://ror.org/05k73zm37</td>
<td>FI</td>
<td>https://www.aka.fi/en/</td>
<td>1248.0</td>
<td>1</td>
<td>75</td>
</tr>
<tr>
<td>33</td>
<td>Agence Nationale de la Recherche (ANR)</td>
<td>http://dx.doi.org/10.13039/501100001665</td>
<td>https://ror.org/00rbzpz17</td>
<td>FR</td>
<td>http://www.agence-nationale-recherche.fr/</td>
<td>30.0</td>
<td>1</td>
<td>76</td>
</tr>
<tr>
<td>34</td>
<td>Fundação para a Ciência e a Tecnologia</td>
<td>http://dx.doi.org/10.13039/501100001871</td>
<td>https://ror.org/00snfqn58</td>
<td>PT</td>
<td>http://www.fct.pt/</td>
<td>1109.0</td>
<td>1</td>
<td>178</td>
</tr>
<tr>
<td>35</td>
<td>Formas</td>
<td>http://dx.doi.org/10.13039/501100001862</td>
<td>https://ror.org/03pjs1y45</td>
<td>SE</td>
<td>http://www.formas.se/</td>
<td>452.0</td>
<td>1</td>
<td>214</td>
</tr>
<tr>
<td>36</td>
<td>Nederlandse Organisatie voor Wetenschappelijk ...</td>
<td>http://dx.doi.org/10.13039/501100003246</td>
<td>https://ror.org/04jsz6e67</td>
<td>NL</td>
<td>http://www.nwo.nl/</td>
<td>459.0</td>
<td>1</td>
<td>156</td>
</tr>
<tr>
<td>37</td>
<td>Science Foundation Ireland (SFI)</td>
<td>http://dx.doi.org/10.13039/501100001602</td>
<td>https://ror.org/0271asj38</td>
<td>IE</td>
<td>http://www.sfi.ie/</td>
<td>210.0</td>
<td>1</td>
<td>107</td>
</tr>
<tr>
<td>38</td>
<td>Research Council of Norway</td>
<td>http://dx.doi.org/10.13039/501100005416</td>
<td>https://ror.org/00epmv149</td>
<td>NO</td>
<td>https://www.forskningsradet.no/en/</td>
<td>266.0</td>
<td>1</td>
<td>166</td>
</tr>
<tr>
<td>39</td>
<td>Forskningsrådet för hälsa, arbetsliv och välfä...</td>
<td>http://dx.doi.org/10.13039/501100006636</td>
<td>https://ror.org/02d290r06</td>
<td>SE</td>
<td>http://www.forte.se/</td>
<td>455.0</td>
<td>1</td>
<td>214</td>
</tr>
<tr>
<td>40</td>
<td>Innovate UK</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/05ar5fy68</td>
<td>GB</td>
<td>https://www.gov.uk/government/organisations/in...</td>
<td>1267.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>41</td>
<td>Diabetes UK</td>
<td>http://dx.doi.org/10.13039/501100000361</td>
<td>https://ror.org/050rgn017</td>
<td>GB</td>
<td>http://www.diabetes.org.uk/</td>
<td>492.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>42</td>
<td>Marie Curie</td>
<td>http://dx.doi.org/10.13039/501100000654</td>
<td>https://ror.org/02aqv1x10</td>
<td>GB</td>
<td>http://www.mariecurie.org.uk/</td>
<td>595.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>43</td>
<td>Action on Hearing Loss</td>
<td>http://dx.doi.org/10.13039/501100000703</td>
<td>https://ror.org/05w6qh410</td>
<td>GB</td>
<td>http://www.actiononhearingloss.org.uk/</td>
<td>412.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>44</td>
<td>Alzheimer's Society</td>
<td>http://dx.doi.org/10.13039/501100000320</td>
<td>https://ror.org/0472gwq90</td>
<td>GB</td>
<td>http://alzheimers.org.uk/</td>
<td>443.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>45</td>
<td>Multiple Sclerosis Society</td>
<td>http://dx.doi.org/10.13039/501100000381</td>
<td>https://ror.org/043fwdk81</td>
<td>GB</td>
<td>http://www.mssociety.org.uk/</td>
<td>745.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>46</td>
<td>Myrovlytis Trust</td>
<td>http://dx.doi.org/10.13039/501100001291</td>
<td>https://ror.org/05bj02613</td>
<td>GB</td>
<td>http://www.myrovlytistrust.org/</td>
<td>858.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>47</td>
<td>National Centre for the Replacement, Refinemen...</td>
<td>http://dx.doi.org/10.13039/501100000849</td>
<td>https://ror.org/02w0kg036</td>
<td>GB</td>
<td>http://www.nc3rs.org.uk/</td>
<td>859.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>48</td>
<td>Worldwide Cancer Reseach</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/031tfbz57</td>
<td>GB</td>
<td>http://www.worldwidecancerresearch.org/</td>
<td>425.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>49</td>
<td>Canadian Institutes of Health Research (CIHR)</td>
<td>http://dx.doi.org/10.13039/501100000024</td>
<td>https://ror.org/01gavpb45</td>
<td>CA</td>
<td>http://www.cihr-irsc.gc.ca/</td>
<td>28.0</td>
<td>1</td>
<td>40</td>
</tr>
<tr>
<td>50</td>
<td>US Department of Energy (DOE)</td>
<td>http://dx.doi.org/10.13039/100000015</td>
<td>https://ror.org/01bj3aw27</td>
<td>US</td>
<td>http://energy.gov/</td>
<td>962.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>51</td>
<td>Agency for Healthcare Research and Quality (AHRQ)</td>
<td>http://dx.doi.org/10.13039/100000133</td>
<td>https://ror.org/03jmfdf59</td>
<td>US</td>
<td>http://www.ahrq.gov/index.html</td>
<td>981.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>52</td>
<td>Institute of Education Sciences (IES)</td>
<td>http://dx.doi.org/10.13039/100005246</td>
<td>https://ror.org/04et59085</td>
<td>US</td>
<td>http://ies.ed.gov/</td>
<td>291.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>53</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://dx.doi.org/10.13039/100000104</td>
<td>https://ror.org/027ka1x80</td>
<td>US</td>
<td>http://science.nasa.gov/</td>
<td>986.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>54</td>
<td>National Science Foundation (NSF)</td>
<td>http://dx.doi.org/10.13039/100000001</td>
<td>https://ror.org/021nxhr62</td>
<td>US</td>
<td>http://www.nsf.gov/</td>
<td>354.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>55</td>
<td>Academy of Medical Science</td>
<td>http://dx.doi.org/10.13039/501100000691</td>
<td>https://ror.org/00c489v88</td>
<td>GB</td>
<td>https://acmedsci.ac.uk/</td>
<td>1125.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>56</td>
<td>Prostate Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100000771</td>
<td>https://ror.org/04dkv6329</td>
<td>GB</td>
<td>http://prostatecanceruk.org/</td>
<td>742.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>57</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://dx.doi.org/10.13039/501100001711</td>
<td>https://ror.org/00yjd3n13</td>
<td>CH</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>25.0</td>
<td>1</td>
<td>215</td>
</tr>
</tbody>
</table>
</div>
```python
organization_funders = organization_funders.rename(columns = {'id' : 'country'})
organization_funders
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>fundref</th>
<th>ror</th>
<th>iso_code</th>
<th>website</th>
<th>sherpa_id</th>
<th>is_funder</th>
<th>country</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>National Institutes of Health (NIH)</td>
<td>http://dx.doi.org/10.13039/100000002</td>
<td>https://ror.org/01cwqze88</td>
<td>US</td>
<td>http://www.nih.gov/</td>
<td>9.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>1</td>
<td>Wellcome Trust</td>
<td>http://dx.doi.org/10.13039/100004440</td>
<td>https://ror.org/029chgv08</td>
<td>GB</td>
<td>http://www.wellcome.ac.uk/</td>
<td>695.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>2</td>
<td>British Heart Foundation (BHF)</td>
<td>http://dx.doi.org/10.13039/501100000274</td>
<td>https://ror.org/02wdwnk04</td>
<td>GB</td>
<td>http://www.bhf.org.uk/</td>
<td>18.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>3</td>
<td>Versus Arthritis</td>
<td>http://dx.doi.org/10.13039/501100000341</td>
<td>https://ror.org/02jkpm469</td>
<td>GB</td>
<td>https://www.versusarthritis.org/</td>
<td>14.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>4</td>
<td>Biotechnology and Biological Sciences Research...</td>
<td>http://dx.doi.org/10.13039/501100000268</td>
<td>https://ror.org/00cwqg982</td>
<td>GB</td>
<td>http://www.bbsrc.ac.uk/home/home.aspx</td>
<td>709.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>5</td>
<td>Blood Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100007903</td>
<td>https://ror.org/0055acf80</td>
<td>GB</td>
<td>https://bloodcancer.org.uk/</td>
<td>925.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>6</td>
<td>Bill &amp; Melinda Gates Foundation</td>
<td>http://dx.doi.org/10.13039/100000865</td>
<td>https://ror.org/0456r8d26</td>
<td>US</td>
<td>http://www.gatesfoundation.org/</td>
<td>961.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>7</td>
<td>Cancer Research UK</td>
<td>http://dx.doi.org/10.13039/501100000289</td>
<td>https://ror.org/054225q67</td>
<td>GB</td>
<td>http://www.cancerresearchuk.org/</td>
<td>19.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>8</td>
<td>Chief Scientist Office, Scottish Executive (CSO)</td>
<td>http://dx.doi.org/10.13039/501100000589</td>
<td>https://ror.org/01613vh25</td>
<td>GB</td>
<td>http://www.cso.scot.nhs.uk/</td>
<td>16.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>9</td>
<td>Department of Health (DH)</td>
<td>http://dx.doi.org/10.13039/501100000272</td>
<td>https://ror.org/0187kwz08</td>
<td>GB</td>
<td>http://www.dh.gov.uk/en/index.htm</td>
<td>943.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>10</td>
<td>Dunhill Medical Trust (DMT)</td>
<td>http://dx.doi.org/10.13039/501100000377</td>
<td>https://ror.org/05ayqqv15</td>
<td>GB</td>
<td>https://dunhillmedical.org.uk/</td>
<td>410.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>11</td>
<td>European Research Council (ERC)</td>
<td>http://dx.doi.org/10.13039/501100000781</td>
<td>https://ror.org/0472cxd90</td>
<td>BE</td>
<td>http://erc.europa.eu/</td>
<td>31.0</td>
<td>1</td>
<td>21</td>
</tr>
<tr>
<td>12</td>
<td>Medical Research Council (MRC)</td>
<td>http://dx.doi.org/10.13039/501100000265</td>
<td>https://ror.org/03x94j517</td>
<td>GB</td>
<td>http://www.mrc.ac.uk/index.htm</td>
<td>705.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>13</td>
<td>Motor Neuron Disease Association (MND Associat...</td>
<td>http://dx.doi.org/10.13039/501100000406</td>
<td>https://ror.org/02gq0fg61</td>
<td>GB</td>
<td>http://www.mndassociation.org/</td>
<td>562.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>14</td>
<td>Parkinson's UK</td>
<td>http://dx.doi.org/10.13039/501100000304</td>
<td>https://ror.org/02417p338</td>
<td>GB</td>
<td>http://www.parkinsons.org.uk/</td>
<td>411.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>15</td>
<td>Telethon Foundation</td>
<td>http://dx.doi.org/10.13039/501100002426</td>
<td>https://ror.org/04xraxn18</td>
<td>IT</td>
<td>https://www.telethon.it/en/</td>
<td>325.0</td>
<td>1</td>
<td>110</td>
</tr>
<tr>
<td>16</td>
<td>Howard Hughes Medical Institute (HHMI)</td>
<td>http://dx.doi.org/10.13039/100000011</td>
<td>https://ror.org/006w34k90</td>
<td>US</td>
<td>http://www.hhmi.org/</td>
<td>24.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>17</td>
<td>Arts and Humanities Research Council (AHRC)</td>
<td>http://dx.doi.org/10.13039/501100000267</td>
<td>https://ror.org/0505m1554</td>
<td>GB</td>
<td>http://www.ahrc.ac.uk/Pages/Home.aspx</td>
<td>698.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>18</td>
<td>Austrian Science Fund (FWF)</td>
<td>http://dx.doi.org/10.13039/501100002428</td>
<td>https://ror.org/013tf3c58</td>
<td>AT</td>
<td>http://www.fwf.ac.at/en/</td>
<td>13.0</td>
<td>1</td>
<td>14</td>
</tr>
<tr>
<td>19</td>
<td>Breast Cancer Now</td>
<td>http://dx.doi.org/10.13039/501100007913</td>
<td>https://ror.org/02qa92s63</td>
<td>GB</td>
<td>http://breastcancernow.org/</td>
<td>1065.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>20</td>
<td>Engineering and Physical Sciences Research Cou...</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/0439y7842</td>
<td>GB</td>
<td>http://www.epsrc.ac.uk/Pages/default.aspx</td>
<td>722.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>21</td>
<td>Natural Environment Research Council (NERC)</td>
<td>http://dx.doi.org/10.13039/501100000270</td>
<td>https://ror.org/02b5d8509</td>
<td>GB</td>
<td>https://nerc.ukri.org/</td>
<td>726.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>22</td>
<td>Science and Technology Facilities Council (STFC)</td>
<td>http://dx.doi.org/10.13039/501100000271</td>
<td>https://ror.org/057g20z61</td>
<td>GB</td>
<td>http://www.stfc.ac.uk/</td>
<td>716.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>23</td>
<td>Vetenskapsrådet</td>
<td>http://dx.doi.org/10.13039/501100004359</td>
<td>https://ror.org/03zttf063</td>
<td>SE</td>
<td>http://www.vr.se/</td>
<td>302.0</td>
<td>1</td>
<td>214</td>
</tr>
<tr>
<td>24</td>
<td>World Health Organization (WHO)</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/01f80g185</td>
<td>CH</td>
<td>http://www.who.int/</td>
<td>903.0</td>
<td>1</td>
<td>215</td>
</tr>
<tr>
<td>25</td>
<td>World Bank</td>
<td>http://dx.doi.org/10.13039/100004421</td>
<td>https://ror.org/00ae7jd04</td>
<td>US</td>
<td>http://www.worldbank.org/</td>
<td>525.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>26</td>
<td>Yorkshire Cancer Research</td>
<td>http://dx.doi.org/10.13039/501100002653</td>
<td>https://ror.org/02cddnn97</td>
<td>GB</td>
<td>http://www.yorkshirecancerresearch.org.uk/</td>
<td>428.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>27</td>
<td>Economic and Social Research Council (ESRC)</td>
<td>http://dx.doi.org/10.13039/501100000269</td>
<td>https://ror.org/03n0ht308</td>
<td>GB</td>
<td>http://www.esrc.ac.uk/</td>
<td>717.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>28</td>
<td>Higher Education Funding Council for England (...</td>
<td>http://dx.doi.org/10.13039/501100000384</td>
<td>https://ror.org/02wxr8x18</td>
<td>GB</td>
<td>http://www.hefce.ac.uk/</td>
<td>877.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>29</td>
<td>Higher Education Funding Council for Wales (HE...</td>
<td>http://dx.doi.org/10.13039/501100000383</td>
<td>https://ror.org/056y81r79</td>
<td>GB</td>
<td>http://www.hefcw.ac.uk/home/home.aspx</td>
<td>881.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>30</td>
<td>Scottish Funding Council (SFC)</td>
<td>http://dx.doi.org/10.13039/501100000360</td>
<td>https://ror.org/056bwcz71</td>
<td>GB</td>
<td>http://www.sfc.ac.uk/</td>
<td>887.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>31</td>
<td>Department for the Economy, Northern Ireland</td>
<td>http://dx.doi.org/10.13039/100008303</td>
<td>https://ror.org/05w9mt194</td>
<td>GB</td>
<td>https://www.economy-ni.gov.uk/</td>
<td>884.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>32</td>
<td>Academy of Finland</td>
<td>http://dx.doi.org/10.13039/501100002341</td>
<td>https://ror.org/05k73zm37</td>
<td>FI</td>
<td>https://www.aka.fi/en/</td>
<td>1248.0</td>
<td>1</td>
<td>75</td>
</tr>
<tr>
<td>33</td>
<td>Agence Nationale de la Recherche (ANR)</td>
<td>http://dx.doi.org/10.13039/501100001665</td>
<td>https://ror.org/00rbzpz17</td>
<td>FR</td>
<td>http://www.agence-nationale-recherche.fr/</td>
<td>30.0</td>
<td>1</td>
<td>76</td>
</tr>
<tr>
<td>34</td>
<td>Fundação para a Ciência e a Tecnologia</td>
<td>http://dx.doi.org/10.13039/501100001871</td>
<td>https://ror.org/00snfqn58</td>
<td>PT</td>
<td>http://www.fct.pt/</td>
<td>1109.0</td>
<td>1</td>
<td>178</td>
</tr>
<tr>
<td>35</td>
<td>Formas</td>
<td>http://dx.doi.org/10.13039/501100001862</td>
<td>https://ror.org/03pjs1y45</td>
<td>SE</td>
<td>http://www.formas.se/</td>
<td>452.0</td>
<td>1</td>
<td>214</td>
</tr>
<tr>
<td>36</td>
<td>Nederlandse Organisatie voor Wetenschappelijk ...</td>
<td>http://dx.doi.org/10.13039/501100003246</td>
<td>https://ror.org/04jsz6e67</td>
<td>NL</td>
<td>http://www.nwo.nl/</td>
<td>459.0</td>
<td>1</td>
<td>156</td>
</tr>
<tr>
<td>37</td>
<td>Science Foundation Ireland (SFI)</td>
<td>http://dx.doi.org/10.13039/501100001602</td>
<td>https://ror.org/0271asj38</td>
<td>IE</td>
<td>http://www.sfi.ie/</td>
<td>210.0</td>
<td>1</td>
<td>107</td>
</tr>
<tr>
<td>38</td>
<td>Research Council of Norway</td>
<td>http://dx.doi.org/10.13039/501100005416</td>
<td>https://ror.org/00epmv149</td>
<td>NO</td>
<td>https://www.forskningsradet.no/en/</td>
<td>266.0</td>
<td>1</td>
<td>166</td>
</tr>
<tr>
<td>39</td>
<td>Forskningsrådet för hälsa, arbetsliv och välfä...</td>
<td>http://dx.doi.org/10.13039/501100006636</td>
<td>https://ror.org/02d290r06</td>
<td>SE</td>
<td>http://www.forte.se/</td>
<td>455.0</td>
<td>1</td>
<td>214</td>
</tr>
<tr>
<td>40</td>
<td>Innovate UK</td>
<td>http://dx.doi.org/10.13039/501100000266</td>
<td>https://ror.org/05ar5fy68</td>
<td>GB</td>
<td>https://www.gov.uk/government/organisations/in...</td>
<td>1267.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>41</td>
<td>Diabetes UK</td>
<td>http://dx.doi.org/10.13039/501100000361</td>
<td>https://ror.org/050rgn017</td>
<td>GB</td>
<td>http://www.diabetes.org.uk/</td>
<td>492.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>42</td>
<td>Marie Curie</td>
<td>http://dx.doi.org/10.13039/501100000654</td>
<td>https://ror.org/02aqv1x10</td>
<td>GB</td>
<td>http://www.mariecurie.org.uk/</td>
<td>595.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>43</td>
<td>Action on Hearing Loss</td>
<td>http://dx.doi.org/10.13039/501100000703</td>
<td>https://ror.org/05w6qh410</td>
<td>GB</td>
<td>http://www.actiononhearingloss.org.uk/</td>
<td>412.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>44</td>
<td>Alzheimer's Society</td>
<td>http://dx.doi.org/10.13039/501100000320</td>
<td>https://ror.org/0472gwq90</td>
<td>GB</td>
<td>http://alzheimers.org.uk/</td>
<td>443.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>45</td>
<td>Multiple Sclerosis Society</td>
<td>http://dx.doi.org/10.13039/501100000381</td>
<td>https://ror.org/043fwdk81</td>
<td>GB</td>
<td>http://www.mssociety.org.uk/</td>
<td>745.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>46</td>
<td>Myrovlytis Trust</td>
<td>http://dx.doi.org/10.13039/501100001291</td>
<td>https://ror.org/05bj02613</td>
<td>GB</td>
<td>http://www.myrovlytistrust.org/</td>
<td>858.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>47</td>
<td>National Centre for the Replacement, Refinemen...</td>
<td>http://dx.doi.org/10.13039/501100000849</td>
<td>https://ror.org/02w0kg036</td>
<td>GB</td>
<td>http://www.nc3rs.org.uk/</td>
<td>859.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>48</td>
<td>Worldwide Cancer Reseach</td>
<td>http://dx.doi.org/10.13039/100004423</td>
<td>https://ror.org/031tfbz57</td>
<td>GB</td>
<td>http://www.worldwidecancerresearch.org/</td>
<td>425.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>49</td>
<td>Canadian Institutes of Health Research (CIHR)</td>
<td>http://dx.doi.org/10.13039/501100000024</td>
<td>https://ror.org/01gavpb45</td>
<td>CA</td>
<td>http://www.cihr-irsc.gc.ca/</td>
<td>28.0</td>
<td>1</td>
<td>40</td>
</tr>
<tr>
<td>50</td>
<td>US Department of Energy (DOE)</td>
<td>http://dx.doi.org/10.13039/100000015</td>
<td>https://ror.org/01bj3aw27</td>
<td>US</td>
<td>http://energy.gov/</td>
<td>962.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>51</td>
<td>Agency for Healthcare Research and Quality (AHRQ)</td>
<td>http://dx.doi.org/10.13039/100000133</td>
<td>https://ror.org/03jmfdf59</td>
<td>US</td>
<td>http://www.ahrq.gov/index.html</td>
<td>981.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>52</td>
<td>Institute of Education Sciences (IES)</td>
<td>http://dx.doi.org/10.13039/100005246</td>
<td>https://ror.org/04et59085</td>
<td>US</td>
<td>http://ies.ed.gov/</td>
<td>291.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>53</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://dx.doi.org/10.13039/100000104</td>
<td>https://ror.org/027ka1x80</td>
<td>US</td>
<td>http://science.nasa.gov/</td>
<td>986.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>54</td>
<td>National Science Foundation (NSF)</td>
<td>http://dx.doi.org/10.13039/100000001</td>
<td>https://ror.org/021nxhr62</td>
<td>US</td>
<td>http://www.nsf.gov/</td>
<td>354.0</td>
<td>1</td>
<td>236</td>
</tr>
<tr>
<td>55</td>
<td>Academy of Medical Science</td>
<td>http://dx.doi.org/10.13039/501100000691</td>
<td>https://ror.org/00c489v88</td>
<td>GB</td>
<td>https://acmedsci.ac.uk/</td>
<td>1125.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>56</td>
<td>Prostate Cancer UK</td>
<td>http://dx.doi.org/10.13039/501100000771</td>
<td>https://ror.org/04dkv6329</td>
<td>GB</td>
<td>http://prostatecanceruk.org/</td>
<td>742.0</td>
<td>1</td>
<td>234</td>
</tr>
<tr>
<td>57</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://dx.doi.org/10.13039/501100001711</td>
<td>https://ror.org/00yjd3n13</td>
<td>CH</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>25.0</td>
<td>1</td>
<td>215</td>
</tr>
</tbody>
</table>
</div>
```python
# ajout des organizations suisses
organization = pd.read_csv('ror/ror_ch_hei_export.tsv', encoding='utf-8', header=0, sep='\t', dtype={'fundref': str, 'orgref': str}, na_filter=False)
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ror</th>
<th>name</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_de</th>
<th>label_it</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>acronym</th>
<th>aliases</th>
<th>isni</th>
<th>fundref</th>
<th>orgref</th>
<th>wikidata</th>
<th>grid</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>https://ror.org/032ymzc07</td>
<td>University of Applied Sciences of the Grisons</td>
<td></td>
<td></td>
<td>Fachhochschule Graubünden</td>
<td></td>
<td>https://www.fhgr.ch/en/</td>
<td>215</td>
<td>1963</td>
<td>0</td>
<td></td>
<td>Hochschule für Technik und Wirtschaft Chur</td>
<td>0000 0000 8718 2812</td>
<td></td>
<td></td>
<td>Q1622220</td>
<td>grid.460104.7</td>
</tr>
<tr>
<td>1</td>
<td>https://ror.org/04mq2g308</td>
<td>University of Applied Sciences and Arts Northw...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.fhnw.ch/homepage</td>
<td>215</td>
<td>2006</td>
<td>0</td>
<td>FHNW</td>
<td>Fachhochschule Nordwestschweiz</td>
<td>0000 0001 1497 8091</td>
<td></td>
<td></td>
<td></td>
<td>grid.410380.e</td>
</tr>
<tr>
<td>2</td>
<td>https://ror.org/0210tb741</td>
<td>Forschungsinstitut für biologischen Landbau (F...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.fibl.org/en/germany/location-de.html</td>
<td>215</td>
<td></td>
<td>0</td>
<td>FiBL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.506220.3</td>
</tr>
<tr>
<td>3</td>
<td>https://ror.org/007ygn379</td>
<td>Graduate Institute of International and Develo...</td>
<td></td>
<td>Institut de Hautes études Internationales et d...</td>
<td>Hochschulinstitut für internationale Studien u...</td>
<td></td>
<td>http://graduateinstitute.ch/home.html</td>
<td>215</td>
<td>1927</td>
<td>0</td>
<td>IHEID</td>
<td>Graduate Institute Geneva</td>
<td>0000 0001 2296 9873</td>
<td></td>
<td>14744053</td>
<td>Q691686</td>
<td>grid.424404.2</td>
</tr>
<tr>
<td>4</td>
<td>https://ror.org/01xkakk17</td>
<td>University of Applied Sciences and Arts Wester...</td>
<td></td>
<td>Haute École Spécialisée de Suisse Occidentale</td>
<td>Fachhochschule Westschweiz</td>
<td></td>
<td>http://www.hes-so.ch/en/homepage-hes-so-1679.html</td>
<td>215</td>
<td>1998</td>
<td>0</td>
<td>HES-SO</td>
<td></td>
<td>0000 0001 0943 1999</td>
<td></td>
<td>10128956</td>
<td>Q168003</td>
<td>grid.5681.a</td>
</tr>
<tr>
<td>5</td>
<td>https://ror.org/015pmkr43</td>
<td>Haute École Pédagogique BEJUNE (HEP BEJUNE)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hep-bejune.ch/</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP BEJUNE</td>
<td></td>
<td>0000 0001 0658 3479</td>
<td></td>
<td></td>
<td></td>
<td>grid.469449.2</td>
</tr>
<tr>
<td>6</td>
<td>https://ror.org/048gre751</td>
<td>Haute École Pédagogique Fribourg (HEP-PH FR)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.hepfr.ch/</td>
<td>215</td>
<td>1990</td>
<td>0</td>
<td>HEP-PH FR</td>
<td></td>
<td>0000 0001 0266 4909</td>
<td></td>
<td></td>
<td></td>
<td>grid.469451.b</td>
</tr>
<tr>
<td>7</td>
<td>https://ror.org/01bvm0h13</td>
<td>Haute École Pédagogique du Canton de Vaud (HEP...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hepl.ch/cms/accueil.html</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP Vaud</td>
<td></td>
<td>0000 0004 0613 4050</td>
<td></td>
<td></td>
<td></td>
<td>grid.466224.0</td>
</tr>
<tr>
<td>8</td>
<td>https://ror.org/02ejkey04</td>
<td>Zurich University of Applied Sciences in Busin...</td>
<td></td>
<td></td>
<td>Hochschule für Wirtschaft Zürich</td>
<td></td>
<td>http://www.fh-hwz.ch/en</td>
<td>215</td>
<td>1986</td>
<td>0</td>
<td>HWZ</td>
<td></td>
<td>0000 0001 0008 3713</td>
<td></td>
<td>30805829</td>
<td>Q1488771</td>
<td>grid.449909.9</td>
</tr>
<tr>
<td>9</td>
<td>https://ror.org/04nd0xd48</td>
<td>Lucerne University of Applied Sciences and Arts</td>
<td></td>
<td>Haute École de lucerne</td>
<td>Hochschule Luzern</td>
<td></td>
<td>https://www.hslu.ch/en/</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2191 8943</td>
<td></td>
<td>19480920</td>
<td>Q664028</td>
<td>grid.425064.1</td>
</tr>
<tr>
<td>10</td>
<td>https://ror.org/00w9q2c06</td>
<td>University of Applied Sciences of Special Need...</td>
<td></td>
<td></td>
<td>Interkantonale Hochschule für Heilpädagogik</td>
<td></td>
<td>http://www.hfh.ch/en/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td>HfH</td>
<td>Zurich Training College for Teachers of Specia...</td>
<td>0000 0001 0710 6332</td>
<td></td>
<td></td>
<td></td>
<td>grid.466279.8</td>
</tr>
<tr>
<td>11</td>
<td>https://ror.org/049c2kr37</td>
<td>Kalaidos University of Applied Sciences (Kalai...</td>
<td></td>
<td></td>
<td>Kalaidos Fachhochschule</td>
<td></td>
<td>https://www.kalaidos-fh.ch/de-CH</td>
<td>215</td>
<td>1995</td>
<td>0</td>
<td>Kalaidos UAS</td>
<td></td>
<td>0000 0004 0453 9054</td>
<td></td>
<td>6746630</td>
<td>Q681372</td>
<td>grid.449532.d</td>
</tr>
<tr>
<td>12</td>
<td>https://ror.org/021f7p178</td>
<td>Lib4RI - Library for the Research Institutes w...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.lib4ri.ch/</td>
<td>215</td>
<td>2011</td>
<td>0</td>
<td></td>
<td>Lib4RI</td>
<td>0000 0004 0624 8541</td>
<td></td>
<td></td>
<td>Q1278450</td>
<td>grid.458352.d</td>
</tr>
<tr>
<td>13</td>
<td>https://ror.org/00p9jf779</td>
<td>Medicines for Malaria Venture (MMV)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.mmv.org/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td>MMV</td>
<td></td>
<td>0000 0004 0432 5267</td>
<td>501100004167</td>
<td></td>
<td>Q6806774</td>
<td>grid.452605.0</td>
</tr>
<tr>
<td>14</td>
<td>https://ror.org/038mj2660</td>
<td>Ostschweizer Fachhochschule OST</td>
<td>Eastern Switzerland University of Applied Scie...</td>
<td></td>
<td></td>
<td></td>
<td>https://www.ost.ch/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.510272.3</td>
</tr>
<tr>
<td>15</td>
<td>https://ror.org/01awgk221</td>
<td>Zurich University of Teacher Education (PHZH)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zürich</td>
<td></td>
<td>https://phzh.ch/en/</td>
<td>215</td>
<td>2002</td>
<td>0</td>
<td>PHZH</td>
<td>PH Zürich</td>
<td>0000 0000 9666 1858</td>
<td></td>
<td></td>
<td></td>
<td>grid.483054.e</td>
</tr>
<tr>
<td>16</td>
<td>https://ror.org/05jf1ma54</td>
<td>Pädagogische Hochschule Bern</td>
<td>Bern University of Teacher Education</td>
<td></td>
<td></td>
<td></td>
<td>https://www.phbern.ch</td>
<td>215</td>
<td>2005</td>
<td>0</td>
<td></td>
<td>PHBern</td>
<td>0000 0000 8585 5665</td>
<td></td>
<td></td>
<td></td>
<td>grid.454333.6</td>
</tr>
<tr>
<td>17</td>
<td>https://ror.org/02fjgft97</td>
<td>Pädagogische Hochschule Graubünden (PHGR)</td>
<td></td>
<td></td>
<td></td>
<td>Alta scuola pedagogica dei Grigioni</td>
<td>http://www.phgr.ch/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHGR</td>
<td></td>
<td>0000 0000 9317 283X</td>
<td></td>
<td></td>
<td></td>
<td>grid.469478.0</td>
</tr>
<tr>
<td>18</td>
<td>https://ror.org/0235ynq74</td>
<td>University of Teacher Education Lucerne</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Luzern</td>
<td></td>
<td>http://www.phlu.ch/ute-lucerne/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td></td>
<td>PH Luzern</td>
<td>0000 0001 0348 1637</td>
<td></td>
<td></td>
<td></td>
<td>grid.465965.d</td>
</tr>
<tr>
<td>19</td>
<td>https://ror.org/03fs41j10</td>
<td>Pädagogische Hochschule Schaffhausen (PHSH)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phsh.ch/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHSH</td>
<td></td>
<td>0000 0004 0450 7546</td>
<td></td>
<td></td>
<td></td>
<td>grid.466133.5</td>
</tr>
<tr>
<td>20</td>
<td>https://ror.org/00rqdn375</td>
<td>Schwyz University of Teacher Education (PHSZ)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Schwyz</td>
<td></td>
<td>https://www.phsz.ch/en/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHSZ</td>
<td>PHZ Schwyz</td>
<td>0000 0004 0613 7454</td>
<td></td>
<td></td>
<td></td>
<td>grid.466169.a</td>
</tr>
<tr>
<td>21</td>
<td>https://ror.org/05m37v666</td>
<td>St.Gallen University of Teacher Education (PHSG)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule St. Gallen</td>
<td></td>
<td>https://www.phsg.ch/en</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>PHSG</td>
<td></td>
<td>0000 0001 0271 5139</td>
<td></td>
<td></td>
<td>Q1768652</td>
<td>grid.466208.e</td>
</tr>
<tr>
<td>22</td>
<td>https://ror.org/04bf6dq94</td>
<td>Pädagogische Hochschule Thurgau (PHTG)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phtg.ch/home/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHTG</td>
<td></td>
<td>0000 0004 0613 3824</td>
<td></td>
<td></td>
<td></td>
<td>grid.466322.7</td>
</tr>
<tr>
<td>23</td>
<td>https://ror.org/05a28rw58</td>
<td>ETH Zurich (ETH Zurich)</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td>Politecnico federale di Zurigo</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>0000 0001 2156 2780</td>
<td>501100003006</td>
<td>210910</td>
<td>Q11942</td>
<td>grid.5801.c</td>
</tr>
<tr>
<td>24</td>
<td>https://ror.org/02s376052</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td></td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>0000000121839049</td>
<td>501100001703</td>
<td>71968</td>
<td>Q262760</td>
<td>grid.5333.6</td>
</tr>
<tr>
<td>25</td>
<td>https://ror.org/00zg4za48</td>
<td>Swiss Federal Institute for Vocational Educati...</td>
<td></td>
<td>Institut Fédéral des Hautes Études en Formatio...</td>
<td>Eidgenössisches Hochschulinstitut für Berufsbi...</td>
<td></td>
<td>http://www.ehb-schweiz.ch/en/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>SFIVET</td>
<td></td>
<td>0000 0001 2285 5681</td>
<td></td>
<td></td>
<td>Q1302632</td>
<td>grid.466173.1</td>
</tr>
<tr>
<td>26</td>
<td>https://ror.org/01ggx4157</td>
<td>European Organization for Nuclear Research (CERN)</td>
<td></td>
<td>Organisation européenne pour la recherche nucl...</td>
<td>Europäische Organisation für Kernforschung</td>
<td></td>
<td>http://home.web.cern.ch/</td>
<td>215</td>
<td>1954</td>
<td>0</td>
<td>CERN</td>
<td></td>
<td>0000 0001 2156 142X</td>
<td></td>
<td>37351</td>
<td>Q42944</td>
<td>grid.9132.9</td>
</tr>
<tr>
<td>27</td>
<td>https://ror.org/02bnkt322</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>BFH</td>
<td></td>
<td>0000 0001 0688 6779</td>
<td>501100006259</td>
<td>4365265</td>
<td>Q466455</td>
<td>grid.424060.4</td>
</tr>
<tr>
<td>28</td>
<td>https://ror.org/04d8ztx87</td>
<td>Agroscope</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 4681 910X</td>
<td></td>
<td></td>
<td>Q397466</td>
<td>grid.417771.3</td>
</tr>
<tr>
<td>29</td>
<td>https://ror.org/02crff812</td>
<td>University of Zurich (UZH)</td>
<td></td>
<td>Université de zurich</td>
<td>Universität Zürich</td>
<td>Università di Zurigo</td>
<td>http://www.uzh.ch/index_en.html</td>
<td>215</td>
<td>1833</td>
<td>0</td>
<td>UZH</td>
<td></td>
<td>0000 0004 1937 0650</td>
<td>501100006447</td>
<td>314803</td>
<td>Q206702</td>
<td>grid.7400.3</td>
</tr>
<tr>
<td>30</td>
<td>https://ror.org/022fs9h90</td>
<td>University of Fribourg</td>
<td></td>
<td>Université de Fribourg</td>
<td>Universität Freiburg</td>
<td>Università di Friburgo</td>
<td>http://www.unifr.ch/home/welcomeE.php</td>
<td>215</td>
<td>1889</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 0478 1713</td>
<td>501100005869</td>
<td>535267</td>
<td>Q36188</td>
<td>grid.8534.a</td>
</tr>
<tr>
<td>31</td>
<td>https://ror.org/01swzsf04</td>
<td>University of Geneva (UNIGE)</td>
<td></td>
<td>Université de Genève</td>
<td></td>
<td>Università di Ginevra</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>0000 0001 2322 4988</td>
<td>501100006389</td>
<td>342348</td>
<td>Q503473</td>
<td>grid.8591.5</td>
</tr>
<tr>
<td>32</td>
<td>https://ror.org/019whta54</td>
<td>University of Lausanne (UNIL)</td>
<td></td>
<td>Université de Lausanne</td>
<td>Universität Lausanne</td>
<td>Università di Losanna</td>
<td>http://www.unil.ch/central/en/home.html</td>
<td>215</td>
<td>1537</td>
<td>0</td>
<td>UNIL</td>
<td>Schola Lausannensis</td>
<td>0000 0001 2165 4204</td>
<td>501100006390</td>
<td>79810</td>
<td>Q658975</td>
<td>grid.9851.5</td>
</tr>
<tr>
<td>33</td>
<td>https://ror.org/00vasag41</td>
<td>University of Neuchâtel</td>
<td></td>
<td>Université de neuchâtel</td>
<td>Universität Neuenburg</td>
<td></td>
<td>http://www2.unine.ch/</td>
<td>215</td>
<td>1838</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2297 7718</td>
<td>501100005353</td>
<td>3662101</td>
<td>Q541548</td>
<td>grid.10711.36</td>
</tr>
<tr>
<td>34</td>
<td>https://ror.org/05r0ap620</td>
<td>Zurich University of the Arts</td>
<td></td>
<td>Haute École d'Art de Zurich</td>
<td>Zürcher Hochschule der Künste</td>
<td></td>
<td>https://www.zhdk.ch/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>39250592</td>
<td>Q222450</td>
<td>grid.449912.3</td>
</tr>
<tr>
<td>35</td>
<td>https://ror.org/05pmsvm27</td>
<td>Zurich University of Applied Sciences (ZHAW)</td>
<td></td>
<td></td>
<td>Zürcher Hochschule für Angewandte Wissenschaften</td>
<td></td>
<td>https://www.zhaw.ch/en/university/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>ZHAW</td>
<td></td>
<td>0000000122291644</td>
<td></td>
<td>30930550</td>
<td>Q2605554</td>
<td>grid.19739.35</td>
</tr>
<tr>
<td>36</td>
<td>https://ror.org/05ghhx264</td>
<td>University of Teacher Education Zug (PH Zug)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zug</td>
<td></td>
<td>https://www.zg.ch/behoerden/direktion-fur-bild...</td>
<td>215</td>
<td>2013</td>
<td>0</td>
<td>PH Zug</td>
<td></td>
<td>0000 0004 0449 2225</td>
<td></td>
<td></td>
<td></td>
<td>grid.466274.5</td>
</tr>
<tr>
<td>37</td>
<td>https://ror.org/03mcsbr76</td>
<td>Swiss Ornithological Institute</td>
<td></td>
<td></td>
<td>Schweizerische Vogelwarte</td>
<td></td>
<td>http://www.vogelwarte.ch/de/home/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 1512 3677</td>
<td></td>
<td></td>
<td>Q663638</td>
<td>grid.419767.a</td>
</tr>
<tr>
<td>38</td>
<td>https://ror.org/05ep8g269</td>
<td>University of Applied Sciences and Arts of Sou...</td>
<td></td>
<td></td>
<td></td>
<td>Scuola Universitaria Professionale della Svizz...</td>
<td>http://www.supsi.ch/home_en.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>SUPSI</td>
<td></td>
<td>0000000123252233</td>
<td></td>
<td>34066841</td>
<td>Q663984</td>
<td>grid.16058.3a</td>
</tr>
<tr>
<td>39</td>
<td>https://ror.org/03c4atk17</td>
<td>Universita della Svizzera Italiana (USI)</td>
<td>University of Italian Switzerland</td>
<td>Université de la suisse italienne</td>
<td></td>
<td>Università della Svizzera italiana</td>
<td>http://www.usi.ch/en/index.htm</td>
<td>215</td>
<td>1996</td>
<td>0</td>
<td>USI</td>
<td></td>
<td>0000 0001 2203 2861</td>
<td></td>
<td>2290642</td>
<td>Q689617</td>
<td>grid.29078.34</td>
</tr>
<tr>
<td>40</td>
<td>https://ror.org/02s6k3f65</td>
<td>University of Basel</td>
<td></td>
<td>Université de bâle</td>
<td>Universität Basel</td>
<td>Università di Basilea</td>
<td>https://www.unibas.ch/de</td>
<td>215</td>
<td>1460</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 1937 0642</td>
<td>100008375</td>
<td>427614</td>
<td>Q372608</td>
<td>grid.6612.3</td>
</tr>
<tr>
<td>41</td>
<td>https://ror.org/02k7v4d05</td>
<td>University of Bern (UB)</td>
<td></td>
<td>Université de Berne</td>
<td>Universität Bern</td>
<td>Università di Berna</td>
<td>http://www.unibe.ch/eng/</td>
<td>215</td>
<td>1834</td>
<td>0</td>
<td>UB</td>
<td></td>
<td>0000 0001 0726 5157</td>
<td>100009068</td>
<td>1157515</td>
<td>Q659080</td>
<td>grid.5734.5</td>
</tr>
<tr>
<td>42</td>
<td>https://ror.org/01qjrx392</td>
<td>University of Liechtenstein</td>
<td></td>
<td></td>
<td>Universität Liechtenstein</td>
<td></td>
<td>https://www.uni.li/study/de/</td>
<td>128</td>
<td>1961</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2227 4668</td>
<td></td>
<td>10554064</td>
<td>Q974328</td>
<td>grid.445905.9</td>
</tr>
<tr>
<td>43</td>
<td>https://ror.org/00kgrkn83</td>
<td>University of Lucerne (UNILU)</td>
<td></td>
<td>Université de lucerne</td>
<td>Universität Luzern</td>
<td>Università di Lucerna</td>
<td>http://www.unilu.ch/</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>UNILU</td>
<td></td>
<td>0000 0001 1456 7938</td>
<td></td>
<td>21004764</td>
<td>Q673308</td>
<td>grid.449852.6</td>
</tr>
<tr>
<td>44</td>
<td>https://ror.org/0561a3s31</td>
<td>University of St. Gallen (HSG)</td>
<td></td>
<td>Université de saint-gall</td>
<td>Universität St. Gallen</td>
<td>Università di San Gallo</td>
<td>http://www.es.unisg.ch/en/</td>
<td>215</td>
<td>1898</td>
<td>0</td>
<td>HSG</td>
<td></td>
<td>0000 0001 2156 6618</td>
<td>100009572</td>
<td>751473</td>
<td>Q673354</td>
<td>grid.15775.31</td>
</tr>
<tr>
<td>45</td>
<td>https://ror.org/040gs8e06</td>
<td>Pädagogische Hochschule Wallis (PH-VS)</td>
<td></td>
<td>Haute École Pédagogique du Valais</td>
<td></td>
<td></td>
<td>http://www.hepvs.ch/de</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>PH-VS</td>
<td></td>
<td>0000 0001 2178 3217</td>
<td></td>
<td></td>
<td></td>
<td>grid.466216.1</td>
</tr>
</tbody>
</table>
</div>
```python
# tri par nom
organization = organization.sort_values(by='name')
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ror</th>
<th>name</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_de</th>
<th>label_it</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>acronym</th>
<th>aliases</th>
<th>isni</th>
<th>fundref</th>
<th>orgref</th>
<th>wikidata</th>
<th>grid</th>
</tr>
</thead>
<tbody>
<tr>
<td>28</td>
<td>https://ror.org/04d8ztx87</td>
<td>Agroscope</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 4681 910X</td>
<td></td>
<td></td>
<td>Q397466</td>
<td>grid.417771.3</td>
</tr>
<tr>
<td>27</td>
<td>https://ror.org/02bnkt322</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>BFH</td>
<td></td>
<td>0000 0001 0688 6779</td>
<td>501100006259</td>
<td>4365265</td>
<td>Q466455</td>
<td>grid.424060.4</td>
</tr>
<tr>
<td>23</td>
<td>https://ror.org/05a28rw58</td>
<td>ETH Zurich (ETH Zurich)</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td>Politecnico federale di Zurigo</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>0000 0001 2156 2780</td>
<td>501100003006</td>
<td>210910</td>
<td>Q11942</td>
<td>grid.5801.c</td>
</tr>
<tr>
<td>26</td>
<td>https://ror.org/01ggx4157</td>
<td>European Organization for Nuclear Research (CERN)</td>
<td></td>
<td>Organisation européenne pour la recherche nucl...</td>
<td>Europäische Organisation für Kernforschung</td>
<td></td>
<td>http://home.web.cern.ch/</td>
<td>215</td>
<td>1954</td>
<td>0</td>
<td>CERN</td>
<td></td>
<td>0000 0001 2156 142X</td>
<td></td>
<td>37351</td>
<td>Q42944</td>
<td>grid.9132.9</td>
</tr>
<tr>
<td>2</td>
<td>https://ror.org/0210tb741</td>
<td>Forschungsinstitut für biologischen Landbau (F...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.fibl.org/en/germany/location-de.html</td>
<td>215</td>
<td></td>
<td>0</td>
<td>FiBL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.506220.3</td>
</tr>
<tr>
<td>3</td>
<td>https://ror.org/007ygn379</td>
<td>Graduate Institute of International and Develo...</td>
<td></td>
<td>Institut de Hautes études Internationales et d...</td>
<td>Hochschulinstitut für internationale Studien u...</td>
<td></td>
<td>http://graduateinstitute.ch/home.html</td>
<td>215</td>
<td>1927</td>
<td>0</td>
<td>IHEID</td>
<td>Graduate Institute Geneva</td>
<td>0000 0001 2296 9873</td>
<td></td>
<td>14744053</td>
<td>Q691686</td>
<td>grid.424404.2</td>
</tr>
<tr>
<td>5</td>
<td>https://ror.org/015pmkr43</td>
<td>Haute École Pédagogique BEJUNE (HEP BEJUNE)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hep-bejune.ch/</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP BEJUNE</td>
<td></td>
<td>0000 0001 0658 3479</td>
<td></td>
<td></td>
<td></td>
<td>grid.469449.2</td>
</tr>
<tr>
<td>6</td>
<td>https://ror.org/048gre751</td>
<td>Haute École Pédagogique Fribourg (HEP-PH FR)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.hepfr.ch/</td>
<td>215</td>
<td>1990</td>
<td>0</td>
<td>HEP-PH FR</td>
<td></td>
<td>0000 0001 0266 4909</td>
<td></td>
<td></td>
<td></td>
<td>grid.469451.b</td>
</tr>
<tr>
<td>7</td>
<td>https://ror.org/01bvm0h13</td>
<td>Haute École Pédagogique du Canton de Vaud (HEP...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hepl.ch/cms/accueil.html</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP Vaud</td>
<td></td>
<td>0000 0004 0613 4050</td>
<td></td>
<td></td>
<td></td>
<td>grid.466224.0</td>
</tr>
<tr>
<td>11</td>
<td>https://ror.org/049c2kr37</td>
<td>Kalaidos University of Applied Sciences (Kalai...</td>
<td></td>
<td></td>
<td>Kalaidos Fachhochschule</td>
<td></td>
<td>https://www.kalaidos-fh.ch/de-CH</td>
<td>215</td>
<td>1995</td>
<td>0</td>
<td>Kalaidos UAS</td>
<td></td>
<td>0000 0004 0453 9054</td>
<td></td>
<td>6746630</td>
<td>Q681372</td>
<td>grid.449532.d</td>
</tr>
<tr>
<td>12</td>
<td>https://ror.org/021f7p178</td>
<td>Lib4RI - Library for the Research Institutes w...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.lib4ri.ch/</td>
<td>215</td>
<td>2011</td>
<td>0</td>
<td></td>
<td>Lib4RI</td>
<td>0000 0004 0624 8541</td>
<td></td>
<td></td>
<td>Q1278450</td>
<td>grid.458352.d</td>
</tr>
<tr>
<td>9</td>
<td>https://ror.org/04nd0xd48</td>
<td>Lucerne University of Applied Sciences and Arts</td>
<td></td>
<td>Haute École de lucerne</td>
<td>Hochschule Luzern</td>
<td></td>
<td>https://www.hslu.ch/en/</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2191 8943</td>
<td></td>
<td>19480920</td>
<td>Q664028</td>
<td>grid.425064.1</td>
</tr>
<tr>
<td>13</td>
<td>https://ror.org/00p9jf779</td>
<td>Medicines for Malaria Venture (MMV)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.mmv.org/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td>MMV</td>
<td></td>
<td>0000 0004 0432 5267</td>
<td>501100004167</td>
<td></td>
<td>Q6806774</td>
<td>grid.452605.0</td>
</tr>
<tr>
<td>14</td>
<td>https://ror.org/038mj2660</td>
<td>Ostschweizer Fachhochschule OST</td>
<td>Eastern Switzerland University of Applied Scie...</td>
<td></td>
<td></td>
<td></td>
<td>https://www.ost.ch/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.510272.3</td>
</tr>
<tr>
<td>16</td>
<td>https://ror.org/05jf1ma54</td>
<td>Pädagogische Hochschule Bern</td>
<td>Bern University of Teacher Education</td>
<td></td>
<td></td>
<td></td>
<td>https://www.phbern.ch</td>
<td>215</td>
<td>2005</td>
<td>0</td>
<td></td>
<td>PHBern</td>
<td>0000 0000 8585 5665</td>
<td></td>
<td></td>
<td></td>
<td>grid.454333.6</td>
</tr>
<tr>
<td>17</td>
<td>https://ror.org/02fjgft97</td>
<td>Pädagogische Hochschule Graubünden (PHGR)</td>
<td></td>
<td></td>
<td></td>
<td>Alta scuola pedagogica dei Grigioni</td>
<td>http://www.phgr.ch/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHGR</td>
<td></td>
<td>0000 0000 9317 283X</td>
<td></td>
<td></td>
<td></td>
<td>grid.469478.0</td>
</tr>
<tr>
<td>19</td>
<td>https://ror.org/03fs41j10</td>
<td>Pädagogische Hochschule Schaffhausen (PHSH)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phsh.ch/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHSH</td>
<td></td>
<td>0000 0004 0450 7546</td>
<td></td>
<td></td>
<td></td>
<td>grid.466133.5</td>
</tr>
<tr>
<td>22</td>
<td>https://ror.org/04bf6dq94</td>
<td>Pädagogische Hochschule Thurgau (PHTG)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phtg.ch/home/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHTG</td>
<td></td>
<td>0000 0004 0613 3824</td>
<td></td>
<td></td>
<td></td>
<td>grid.466322.7</td>
</tr>
<tr>
<td>45</td>
<td>https://ror.org/040gs8e06</td>
<td>Pädagogische Hochschule Wallis (PH-VS)</td>
<td></td>
<td>Haute École Pédagogique du Valais</td>
<td></td>
<td></td>
<td>http://www.hepvs.ch/de</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>PH-VS</td>
<td></td>
<td>0000 0001 2178 3217</td>
<td></td>
<td></td>
<td></td>
<td>grid.466216.1</td>
</tr>
<tr>
<td>20</td>
<td>https://ror.org/00rqdn375</td>
<td>Schwyz University of Teacher Education (PHSZ)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Schwyz</td>
<td></td>
<td>https://www.phsz.ch/en/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHSZ</td>
<td>PHZ Schwyz</td>
<td>0000 0004 0613 7454</td>
<td></td>
<td></td>
<td></td>
<td>grid.466169.a</td>
</tr>
<tr>
<td>21</td>
<td>https://ror.org/05m37v666</td>
<td>St.Gallen University of Teacher Education (PHSG)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule St. Gallen</td>
<td></td>
<td>https://www.phsg.ch/en</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>PHSG</td>
<td></td>
<td>0000 0001 0271 5139</td>
<td></td>
<td></td>
<td>Q1768652</td>
<td>grid.466208.e</td>
</tr>
<tr>
<td>25</td>
<td>https://ror.org/00zg4za48</td>
<td>Swiss Federal Institute for Vocational Educati...</td>
<td></td>
<td>Institut Fédéral des Hautes Études en Formatio...</td>
<td>Eidgenössisches Hochschulinstitut für Berufsbi...</td>
<td></td>
<td>http://www.ehb-schweiz.ch/en/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>SFIVET</td>
<td></td>
<td>0000 0001 2285 5681</td>
<td></td>
<td></td>
<td>Q1302632</td>
<td>grid.466173.1</td>
</tr>
<tr>
<td>37</td>
<td>https://ror.org/03mcsbr76</td>
<td>Swiss Ornithological Institute</td>
<td></td>
<td></td>
<td>Schweizerische Vogelwarte</td>
<td></td>
<td>http://www.vogelwarte.ch/de/home/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 1512 3677</td>
<td></td>
<td></td>
<td>Q663638</td>
<td>grid.419767.a</td>
</tr>
<tr>
<td>39</td>
<td>https://ror.org/03c4atk17</td>
<td>Universita della Svizzera Italiana (USI)</td>
<td>University of Italian Switzerland</td>
<td>Université de la suisse italienne</td>
<td></td>
<td>Università della Svizzera italiana</td>
<td>http://www.usi.ch/en/index.htm</td>
<td>215</td>
<td>1996</td>
<td>0</td>
<td>USI</td>
<td></td>
<td>0000 0001 2203 2861</td>
<td></td>
<td>2290642</td>
<td>Q689617</td>
<td>grid.29078.34</td>
</tr>
<tr>
<td>1</td>
<td>https://ror.org/04mq2g308</td>
<td>University of Applied Sciences and Arts Northw...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.fhnw.ch/homepage</td>
<td>215</td>
<td>2006</td>
<td>0</td>
<td>FHNW</td>
<td>Fachhochschule Nordwestschweiz</td>
<td>0000 0001 1497 8091</td>
<td></td>
<td></td>
<td></td>
<td>grid.410380.e</td>
</tr>
<tr>
<td>4</td>
<td>https://ror.org/01xkakk17</td>
<td>University of Applied Sciences and Arts Wester...</td>
<td></td>
<td>Haute École Spécialisée de Suisse Occidentale</td>
<td>Fachhochschule Westschweiz</td>
<td></td>
<td>http://www.hes-so.ch/en/homepage-hes-so-1679.html</td>
<td>215</td>
<td>1998</td>
<td>0</td>
<td>HES-SO</td>
<td></td>
<td>0000 0001 0943 1999</td>
<td></td>
<td>10128956</td>
<td>Q168003</td>
<td>grid.5681.a</td>
</tr>
<tr>
<td>38</td>
<td>https://ror.org/05ep8g269</td>
<td>University of Applied Sciences and Arts of Sou...</td>
<td></td>
<td></td>
<td></td>
<td>Scuola Universitaria Professionale della Svizz...</td>
<td>http://www.supsi.ch/home_en.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>SUPSI</td>
<td></td>
<td>0000000123252233</td>
<td></td>
<td>34066841</td>
<td>Q663984</td>
<td>grid.16058.3a</td>
</tr>
<tr>
<td>10</td>
<td>https://ror.org/00w9q2c06</td>
<td>University of Applied Sciences of Special Need...</td>
<td></td>
<td></td>
<td>Interkantonale Hochschule für Heilpädagogik</td>
<td></td>
<td>http://www.hfh.ch/en/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td>HfH</td>
<td>Zurich Training College for Teachers of Specia...</td>
<td>0000 0001 0710 6332</td>
<td></td>
<td></td>
<td></td>
<td>grid.466279.8</td>
</tr>
<tr>
<td>0</td>
<td>https://ror.org/032ymzc07</td>
<td>University of Applied Sciences of the Grisons</td>
<td></td>
<td></td>
<td>Fachhochschule Graubünden</td>
<td></td>
<td>https://www.fhgr.ch/en/</td>
<td>215</td>
<td>1963</td>
<td>0</td>
<td></td>
<td>Hochschule für Technik und Wirtschaft Chur</td>
<td>0000 0000 8718 2812</td>
<td></td>
<td></td>
<td>Q1622220</td>
<td>grid.460104.7</td>
</tr>
<tr>
<td>40</td>
<td>https://ror.org/02s6k3f65</td>
<td>University of Basel</td>
<td></td>
<td>Université de bâle</td>
<td>Universität Basel</td>
<td>Università di Basilea</td>
<td>https://www.unibas.ch/de</td>
<td>215</td>
<td>1460</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 1937 0642</td>
<td>100008375</td>
<td>427614</td>
<td>Q372608</td>
<td>grid.6612.3</td>
</tr>
<tr>
<td>41</td>
<td>https://ror.org/02k7v4d05</td>
<td>University of Bern (UB)</td>
<td></td>
<td>Université de Berne</td>
<td>Universität Bern</td>
<td>Università di Berna</td>
<td>http://www.unibe.ch/eng/</td>
<td>215</td>
<td>1834</td>
<td>0</td>
<td>UB</td>
<td></td>
<td>0000 0001 0726 5157</td>
<td>100009068</td>
<td>1157515</td>
<td>Q659080</td>
<td>grid.5734.5</td>
</tr>
<tr>
<td>30</td>
<td>https://ror.org/022fs9h90</td>
<td>University of Fribourg</td>
<td></td>
<td>Université de Fribourg</td>
<td>Universität Freiburg</td>
<td>Università di Friburgo</td>
<td>http://www.unifr.ch/home/welcomeE.php</td>
<td>215</td>
<td>1889</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 0478 1713</td>
<td>501100005869</td>
<td>535267</td>
<td>Q36188</td>
<td>grid.8534.a</td>
</tr>
<tr>
<td>31</td>
<td>https://ror.org/01swzsf04</td>
<td>University of Geneva (UNIGE)</td>
<td></td>
<td>Université de Genève</td>
<td></td>
<td>Università di Ginevra</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>0000 0001 2322 4988</td>
<td>501100006389</td>
<td>342348</td>
<td>Q503473</td>
<td>grid.8591.5</td>
</tr>
<tr>
<td>32</td>
<td>https://ror.org/019whta54</td>
<td>University of Lausanne (UNIL)</td>
<td></td>
<td>Université de Lausanne</td>
<td>Universität Lausanne</td>
<td>Università di Losanna</td>
<td>http://www.unil.ch/central/en/home.html</td>
<td>215</td>
<td>1537</td>
<td>0</td>
<td>UNIL</td>
<td>Schola Lausannensis</td>
<td>0000 0001 2165 4204</td>
<td>501100006390</td>
<td>79810</td>
<td>Q658975</td>
<td>grid.9851.5</td>
</tr>
<tr>
<td>42</td>
<td>https://ror.org/01qjrx392</td>
<td>University of Liechtenstein</td>
<td></td>
<td></td>
<td>Universität Liechtenstein</td>
<td></td>
<td>https://www.uni.li/study/de/</td>
<td>128</td>
<td>1961</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2227 4668</td>
<td></td>
<td>10554064</td>
<td>Q974328</td>
<td>grid.445905.9</td>
</tr>
<tr>
<td>43</td>
<td>https://ror.org/00kgrkn83</td>
<td>University of Lucerne (UNILU)</td>
<td></td>
<td>Université de lucerne</td>
<td>Universität Luzern</td>
<td>Università di Lucerna</td>
<td>http://www.unilu.ch/</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>UNILU</td>
<td></td>
<td>0000 0001 1456 7938</td>
<td></td>
<td>21004764</td>
<td>Q673308</td>
<td>grid.449852.6</td>
</tr>
<tr>
<td>33</td>
<td>https://ror.org/00vasag41</td>
<td>University of Neuchâtel</td>
<td></td>
<td>Université de neuchâtel</td>
<td>Universität Neuenburg</td>
<td></td>
<td>http://www2.unine.ch/</td>
<td>215</td>
<td>1838</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2297 7718</td>
<td>501100005353</td>
<td>3662101</td>
<td>Q541548</td>
<td>grid.10711.36</td>
</tr>
<tr>
<td>44</td>
<td>https://ror.org/0561a3s31</td>
<td>University of St. Gallen (HSG)</td>
<td></td>
<td>Université de saint-gall</td>
<td>Universität St. Gallen</td>
<td>Università di San Gallo</td>
<td>http://www.es.unisg.ch/en/</td>
<td>215</td>
<td>1898</td>
<td>0</td>
<td>HSG</td>
<td></td>
<td>0000 0001 2156 6618</td>
<td>100009572</td>
<td>751473</td>
<td>Q673354</td>
<td>grid.15775.31</td>
</tr>
<tr>
<td>18</td>
<td>https://ror.org/0235ynq74</td>
<td>University of Teacher Education Lucerne</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Luzern</td>
<td></td>
<td>http://www.phlu.ch/ute-lucerne/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td></td>
<td>PH Luzern</td>
<td>0000 0001 0348 1637</td>
<td></td>
<td></td>
<td></td>
<td>grid.465965.d</td>
</tr>
<tr>
<td>36</td>
<td>https://ror.org/05ghhx264</td>
<td>University of Teacher Education Zug (PH Zug)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zug</td>
<td></td>
<td>https://www.zg.ch/behoerden/direktion-fur-bild...</td>
<td>215</td>
<td>2013</td>
<td>0</td>
<td>PH Zug</td>
<td></td>
<td>0000 0004 0449 2225</td>
<td></td>
<td></td>
<td></td>
<td>grid.466274.5</td>
</tr>
<tr>
<td>29</td>
<td>https://ror.org/02crff812</td>
<td>University of Zurich (UZH)</td>
<td></td>
<td>Université de zurich</td>
<td>Universität Zürich</td>
<td>Università di Zurigo</td>
<td>http://www.uzh.ch/index_en.html</td>
<td>215</td>
<td>1833</td>
<td>0</td>
<td>UZH</td>
<td></td>
<td>0000 0004 1937 0650</td>
<td>501100006447</td>
<td>314803</td>
<td>Q206702</td>
<td>grid.7400.3</td>
</tr>
<tr>
<td>35</td>
<td>https://ror.org/05pmsvm27</td>
<td>Zurich University of Applied Sciences (ZHAW)</td>
<td></td>
<td></td>
<td>Zürcher Hochschule für Angewandte Wissenschaften</td>
<td></td>
<td>https://www.zhaw.ch/en/university/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>ZHAW</td>
<td></td>
<td>0000000122291644</td>
<td></td>
<td>30930550</td>
<td>Q2605554</td>
<td>grid.19739.35</td>
</tr>
<tr>
<td>8</td>
<td>https://ror.org/02ejkey04</td>
<td>Zurich University of Applied Sciences in Busin...</td>
<td></td>
<td></td>
<td>Hochschule für Wirtschaft Zürich</td>
<td></td>
<td>http://www.fh-hwz.ch/en</td>
<td>215</td>
<td>1986</td>
<td>0</td>
<td>HWZ</td>
<td></td>
<td>0000 0001 0008 3713</td>
<td></td>
<td>30805829</td>
<td>Q1488771</td>
<td>grid.449909.9</td>
</tr>
<tr>
<td>15</td>
<td>https://ror.org/01awgk221</td>
<td>Zurich University of Teacher Education (PHZH)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zürich</td>
<td></td>
<td>https://phzh.ch/en/</td>
<td>215</td>
<td>2002</td>
<td>0</td>
<td>PHZH</td>
<td>PH Zürich</td>
<td>0000 0000 9666 1858</td>
<td></td>
<td></td>
<td></td>
<td>grid.483054.e</td>
</tr>
<tr>
<td>34</td>
<td>https://ror.org/05r0ap620</td>
<td>Zurich University of the Arts</td>
<td></td>
<td>Haute École d'Art de Zurich</td>
<td>Zürcher Hochschule der Künste</td>
<td></td>
<td>https://www.zhdk.ch/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>39250592</td>
<td>Q222450</td>
<td>grid.449912.3</td>
</tr>
<tr>
<td>24</td>
<td>https://ror.org/02s376052</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td></td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>0000000121839049</td>
<td>501100001703</td>
<td>71968</td>
<td>Q262760</td>
<td>grid.5333.6</td>
</tr>
</tbody>
</table>
</div>
```python
organization = organization.reset_index(drop=True)
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ror</th>
<th>name</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_de</th>
<th>label_it</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>acronym</th>
<th>aliases</th>
<th>isni</th>
<th>fundref</th>
<th>orgref</th>
<th>wikidata</th>
<th>grid</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>https://ror.org/04d8ztx87</td>
<td>Agroscope</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 4681 910X</td>
<td></td>
<td></td>
<td>Q397466</td>
<td>grid.417771.3</td>
</tr>
<tr>
<td>1</td>
<td>https://ror.org/02bnkt322</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>BFH</td>
<td></td>
<td>0000 0001 0688 6779</td>
<td>501100006259</td>
<td>4365265</td>
<td>Q466455</td>
<td>grid.424060.4</td>
</tr>
<tr>
<td>2</td>
<td>https://ror.org/05a28rw58</td>
<td>ETH Zurich (ETH Zurich)</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td>Politecnico federale di Zurigo</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>0000 0001 2156 2780</td>
<td>501100003006</td>
<td>210910</td>
<td>Q11942</td>
<td>grid.5801.c</td>
</tr>
<tr>
<td>3</td>
<td>https://ror.org/01ggx4157</td>
<td>European Organization for Nuclear Research (CERN)</td>
<td></td>
<td>Organisation européenne pour la recherche nucl...</td>
<td>Europäische Organisation für Kernforschung</td>
<td></td>
<td>http://home.web.cern.ch/</td>
<td>215</td>
<td>1954</td>
<td>0</td>
<td>CERN</td>
<td></td>
<td>0000 0001 2156 142X</td>
<td></td>
<td>37351</td>
<td>Q42944</td>
<td>grid.9132.9</td>
</tr>
<tr>
<td>4</td>
<td>https://ror.org/0210tb741</td>
<td>Forschungsinstitut für biologischen Landbau (F...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.fibl.org/en/germany/location-de.html</td>
<td>215</td>
<td></td>
<td>0</td>
<td>FiBL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.506220.3</td>
</tr>
<tr>
<td>5</td>
<td>https://ror.org/007ygn379</td>
<td>Graduate Institute of International and Develo...</td>
<td></td>
<td>Institut de Hautes études Internationales et d...</td>
<td>Hochschulinstitut für internationale Studien u...</td>
<td></td>
<td>http://graduateinstitute.ch/home.html</td>
<td>215</td>
<td>1927</td>
<td>0</td>
<td>IHEID</td>
<td>Graduate Institute Geneva</td>
<td>0000 0001 2296 9873</td>
<td></td>
<td>14744053</td>
<td>Q691686</td>
<td>grid.424404.2</td>
</tr>
<tr>
<td>6</td>
<td>https://ror.org/015pmkr43</td>
<td>Haute École Pédagogique BEJUNE (HEP BEJUNE)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hep-bejune.ch/</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP BEJUNE</td>
<td></td>
<td>0000 0001 0658 3479</td>
<td></td>
<td></td>
<td></td>
<td>grid.469449.2</td>
</tr>
<tr>
<td>7</td>
<td>https://ror.org/048gre751</td>
<td>Haute École Pédagogique Fribourg (HEP-PH FR)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.hepfr.ch/</td>
<td>215</td>
<td>1990</td>
<td>0</td>
<td>HEP-PH FR</td>
<td></td>
<td>0000 0001 0266 4909</td>
<td></td>
<td></td>
<td></td>
<td>grid.469451.b</td>
</tr>
<tr>
<td>8</td>
<td>https://ror.org/01bvm0h13</td>
<td>Haute École Pédagogique du Canton de Vaud (HEP...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hepl.ch/cms/accueil.html</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP Vaud</td>
<td></td>
<td>0000 0004 0613 4050</td>
<td></td>
<td></td>
<td></td>
<td>grid.466224.0</td>
</tr>
<tr>
<td>9</td>
<td>https://ror.org/049c2kr37</td>
<td>Kalaidos University of Applied Sciences (Kalai...</td>
<td></td>
<td></td>
<td>Kalaidos Fachhochschule</td>
<td></td>
<td>https://www.kalaidos-fh.ch/de-CH</td>
<td>215</td>
<td>1995</td>
<td>0</td>
<td>Kalaidos UAS</td>
<td></td>
<td>0000 0004 0453 9054</td>
<td></td>
<td>6746630</td>
<td>Q681372</td>
<td>grid.449532.d</td>
</tr>
<tr>
<td>10</td>
<td>https://ror.org/021f7p178</td>
<td>Lib4RI - Library for the Research Institutes w...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.lib4ri.ch/</td>
<td>215</td>
<td>2011</td>
<td>0</td>
<td></td>
<td>Lib4RI</td>
<td>0000 0004 0624 8541</td>
<td></td>
<td></td>
<td>Q1278450</td>
<td>grid.458352.d</td>
</tr>
<tr>
<td>11</td>
<td>https://ror.org/04nd0xd48</td>
<td>Lucerne University of Applied Sciences and Arts</td>
<td></td>
<td>Haute École de lucerne</td>
<td>Hochschule Luzern</td>
<td></td>
<td>https://www.hslu.ch/en/</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2191 8943</td>
<td></td>
<td>19480920</td>
<td>Q664028</td>
<td>grid.425064.1</td>
</tr>
<tr>
<td>12</td>
<td>https://ror.org/00p9jf779</td>
<td>Medicines for Malaria Venture (MMV)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.mmv.org/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td>MMV</td>
<td></td>
<td>0000 0004 0432 5267</td>
<td>501100004167</td>
<td></td>
<td>Q6806774</td>
<td>grid.452605.0</td>
</tr>
<tr>
<td>13</td>
<td>https://ror.org/038mj2660</td>
<td>Ostschweizer Fachhochschule OST</td>
<td>Eastern Switzerland University of Applied Scie...</td>
<td></td>
<td></td>
<td></td>
<td>https://www.ost.ch/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.510272.3</td>
</tr>
<tr>
<td>14</td>
<td>https://ror.org/05jf1ma54</td>
<td>Pädagogische Hochschule Bern</td>
<td>Bern University of Teacher Education</td>
<td></td>
<td></td>
<td></td>
<td>https://www.phbern.ch</td>
<td>215</td>
<td>2005</td>
<td>0</td>
<td></td>
<td>PHBern</td>
<td>0000 0000 8585 5665</td>
<td></td>
<td></td>
<td></td>
<td>grid.454333.6</td>
</tr>
<tr>
<td>15</td>
<td>https://ror.org/02fjgft97</td>
<td>Pädagogische Hochschule Graubünden (PHGR)</td>
<td></td>
<td></td>
<td></td>
<td>Alta scuola pedagogica dei Grigioni</td>
<td>http://www.phgr.ch/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHGR</td>
<td></td>
<td>0000 0000 9317 283X</td>
<td></td>
<td></td>
<td></td>
<td>grid.469478.0</td>
</tr>
<tr>
<td>16</td>
<td>https://ror.org/03fs41j10</td>
<td>Pädagogische Hochschule Schaffhausen (PHSH)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phsh.ch/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHSH</td>
<td></td>
<td>0000 0004 0450 7546</td>
<td></td>
<td></td>
<td></td>
<td>grid.466133.5</td>
</tr>
<tr>
<td>17</td>
<td>https://ror.org/04bf6dq94</td>
<td>Pädagogische Hochschule Thurgau (PHTG)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phtg.ch/home/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHTG</td>
<td></td>
<td>0000 0004 0613 3824</td>
<td></td>
<td></td>
<td></td>
<td>grid.466322.7</td>
</tr>
<tr>
<td>18</td>
<td>https://ror.org/040gs8e06</td>
<td>Pädagogische Hochschule Wallis (PH-VS)</td>
<td></td>
<td>Haute École Pédagogique du Valais</td>
<td></td>
<td></td>
<td>http://www.hepvs.ch/de</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>PH-VS</td>
<td></td>
<td>0000 0001 2178 3217</td>
<td></td>
<td></td>
<td></td>
<td>grid.466216.1</td>
</tr>
<tr>
<td>19</td>
<td>https://ror.org/00rqdn375</td>
<td>Schwyz University of Teacher Education (PHSZ)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Schwyz</td>
<td></td>
<td>https://www.phsz.ch/en/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHSZ</td>
<td>PHZ Schwyz</td>
<td>0000 0004 0613 7454</td>
<td></td>
<td></td>
<td></td>
<td>grid.466169.a</td>
</tr>
<tr>
<td>20</td>
<td>https://ror.org/05m37v666</td>
<td>St.Gallen University of Teacher Education (PHSG)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule St. Gallen</td>
<td></td>
<td>https://www.phsg.ch/en</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>PHSG</td>
<td></td>
<td>0000 0001 0271 5139</td>
<td></td>
<td></td>
<td>Q1768652</td>
<td>grid.466208.e</td>
</tr>
<tr>
<td>21</td>
<td>https://ror.org/00zg4za48</td>
<td>Swiss Federal Institute for Vocational Educati...</td>
<td></td>
<td>Institut Fédéral des Hautes Études en Formatio...</td>
<td>Eidgenössisches Hochschulinstitut für Berufsbi...</td>
<td></td>
<td>http://www.ehb-schweiz.ch/en/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>SFIVET</td>
<td></td>
<td>0000 0001 2285 5681</td>
<td></td>
<td></td>
<td>Q1302632</td>
<td>grid.466173.1</td>
</tr>
<tr>
<td>22</td>
<td>https://ror.org/03mcsbr76</td>
<td>Swiss Ornithological Institute</td>
<td></td>
<td></td>
<td>Schweizerische Vogelwarte</td>
<td></td>
<td>http://www.vogelwarte.ch/de/home/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 1512 3677</td>
<td></td>
<td></td>
<td>Q663638</td>
<td>grid.419767.a</td>
</tr>
<tr>
<td>23</td>
<td>https://ror.org/03c4atk17</td>
<td>Universita della Svizzera Italiana (USI)</td>
<td>University of Italian Switzerland</td>
<td>Université de la suisse italienne</td>
<td></td>
<td>Università della Svizzera italiana</td>
<td>http://www.usi.ch/en/index.htm</td>
<td>215</td>
<td>1996</td>
<td>0</td>
<td>USI</td>
<td></td>
<td>0000 0001 2203 2861</td>
<td></td>
<td>2290642</td>
<td>Q689617</td>
<td>grid.29078.34</td>
</tr>
<tr>
<td>24</td>
<td>https://ror.org/04mq2g308</td>
<td>University of Applied Sciences and Arts Northw...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.fhnw.ch/homepage</td>
<td>215</td>
<td>2006</td>
<td>0</td>
<td>FHNW</td>
<td>Fachhochschule Nordwestschweiz</td>
<td>0000 0001 1497 8091</td>
<td></td>
<td></td>
<td></td>
<td>grid.410380.e</td>
</tr>
<tr>
<td>25</td>
<td>https://ror.org/01xkakk17</td>
<td>University of Applied Sciences and Arts Wester...</td>
<td></td>
<td>Haute École Spécialisée de Suisse Occidentale</td>
<td>Fachhochschule Westschweiz</td>
<td></td>
<td>http://www.hes-so.ch/en/homepage-hes-so-1679.html</td>
<td>215</td>
<td>1998</td>
<td>0</td>
<td>HES-SO</td>
<td></td>
<td>0000 0001 0943 1999</td>
<td></td>
<td>10128956</td>
<td>Q168003</td>
<td>grid.5681.a</td>
</tr>
<tr>
<td>26</td>
<td>https://ror.org/05ep8g269</td>
<td>University of Applied Sciences and Arts of Sou...</td>
<td></td>
<td></td>
<td></td>
<td>Scuola Universitaria Professionale della Svizz...</td>
<td>http://www.supsi.ch/home_en.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>SUPSI</td>
<td></td>
<td>0000000123252233</td>
<td></td>
<td>34066841</td>
<td>Q663984</td>
<td>grid.16058.3a</td>
</tr>
<tr>
<td>27</td>
<td>https://ror.org/00w9q2c06</td>
<td>University of Applied Sciences of Special Need...</td>
<td></td>
<td></td>
<td>Interkantonale Hochschule für Heilpädagogik</td>
<td></td>
<td>http://www.hfh.ch/en/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td>HfH</td>
<td>Zurich Training College for Teachers of Specia...</td>
<td>0000 0001 0710 6332</td>
<td></td>
<td></td>
<td></td>
<td>grid.466279.8</td>
</tr>
<tr>
<td>28</td>
<td>https://ror.org/032ymzc07</td>
<td>University of Applied Sciences of the Grisons</td>
<td></td>
<td></td>
<td>Fachhochschule Graubünden</td>
<td></td>
<td>https://www.fhgr.ch/en/</td>
<td>215</td>
<td>1963</td>
<td>0</td>
<td></td>
<td>Hochschule für Technik und Wirtschaft Chur</td>
<td>0000 0000 8718 2812</td>
<td></td>
<td></td>
<td>Q1622220</td>
<td>grid.460104.7</td>
</tr>
<tr>
<td>29</td>
<td>https://ror.org/02s6k3f65</td>
<td>University of Basel</td>
<td></td>
<td>Université de bâle</td>
<td>Universität Basel</td>
<td>Università di Basilea</td>
<td>https://www.unibas.ch/de</td>
<td>215</td>
<td>1460</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 1937 0642</td>
<td>100008375</td>
<td>427614</td>
<td>Q372608</td>
<td>grid.6612.3</td>
</tr>
<tr>
<td>30</td>
<td>https://ror.org/02k7v4d05</td>
<td>University of Bern (UB)</td>
<td></td>
<td>Université de Berne</td>
<td>Universität Bern</td>
<td>Università di Berna</td>
<td>http://www.unibe.ch/eng/</td>
<td>215</td>
<td>1834</td>
<td>0</td>
<td>UB</td>
<td></td>
<td>0000 0001 0726 5157</td>
<td>100009068</td>
<td>1157515</td>
<td>Q659080</td>
<td>grid.5734.5</td>
</tr>
<tr>
<td>31</td>
<td>https://ror.org/022fs9h90</td>
<td>University of Fribourg</td>
<td></td>
<td>Université de Fribourg</td>
<td>Universität Freiburg</td>
<td>Università di Friburgo</td>
<td>http://www.unifr.ch/home/welcomeE.php</td>
<td>215</td>
<td>1889</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 0478 1713</td>
<td>501100005869</td>
<td>535267</td>
<td>Q36188</td>
<td>grid.8534.a</td>
</tr>
<tr>
<td>32</td>
<td>https://ror.org/01swzsf04</td>
<td>University of Geneva (UNIGE)</td>
<td></td>
<td>Université de Genève</td>
<td></td>
<td>Università di Ginevra</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>0000 0001 2322 4988</td>
<td>501100006389</td>
<td>342348</td>
<td>Q503473</td>
<td>grid.8591.5</td>
</tr>
<tr>
<td>33</td>
<td>https://ror.org/019whta54</td>
<td>University of Lausanne (UNIL)</td>
<td></td>
<td>Université de Lausanne</td>
<td>Universität Lausanne</td>
<td>Università di Losanna</td>
<td>http://www.unil.ch/central/en/home.html</td>
<td>215</td>
<td>1537</td>
<td>0</td>
<td>UNIL</td>
<td>Schola Lausannensis</td>
<td>0000 0001 2165 4204</td>
<td>501100006390</td>
<td>79810</td>
<td>Q658975</td>
<td>grid.9851.5</td>
</tr>
<tr>
<td>34</td>
<td>https://ror.org/01qjrx392</td>
<td>University of Liechtenstein</td>
<td></td>
<td></td>
<td>Universität Liechtenstein</td>
<td></td>
<td>https://www.uni.li/study/de/</td>
<td>128</td>
<td>1961</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2227 4668</td>
<td></td>
<td>10554064</td>
<td>Q974328</td>
<td>grid.445905.9</td>
</tr>
<tr>
<td>35</td>
<td>https://ror.org/00kgrkn83</td>
<td>University of Lucerne (UNILU)</td>
<td></td>
<td>Université de lucerne</td>
<td>Universität Luzern</td>
<td>Università di Lucerna</td>
<td>http://www.unilu.ch/</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>UNILU</td>
<td></td>
<td>0000 0001 1456 7938</td>
<td></td>
<td>21004764</td>
<td>Q673308</td>
<td>grid.449852.6</td>
</tr>
<tr>
<td>36</td>
<td>https://ror.org/00vasag41</td>
<td>University of Neuchâtel</td>
<td></td>
<td>Université de neuchâtel</td>
<td>Universität Neuenburg</td>
<td></td>
<td>http://www2.unine.ch/</td>
<td>215</td>
<td>1838</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2297 7718</td>
<td>501100005353</td>
<td>3662101</td>
<td>Q541548</td>
<td>grid.10711.36</td>
</tr>
<tr>
<td>37</td>
<td>https://ror.org/0561a3s31</td>
<td>University of St. Gallen (HSG)</td>
<td></td>
<td>Université de saint-gall</td>
<td>Universität St. Gallen</td>
<td>Università di San Gallo</td>
<td>http://www.es.unisg.ch/en/</td>
<td>215</td>
<td>1898</td>
<td>0</td>
<td>HSG</td>
<td></td>
<td>0000 0001 2156 6618</td>
<td>100009572</td>
<td>751473</td>
<td>Q673354</td>
<td>grid.15775.31</td>
</tr>
<tr>
<td>38</td>
<td>https://ror.org/0235ynq74</td>
<td>University of Teacher Education Lucerne</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Luzern</td>
<td></td>
<td>http://www.phlu.ch/ute-lucerne/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td></td>
<td>PH Luzern</td>
<td>0000 0001 0348 1637</td>
<td></td>
<td></td>
<td></td>
<td>grid.465965.d</td>
</tr>
<tr>
<td>39</td>
<td>https://ror.org/05ghhx264</td>
<td>University of Teacher Education Zug (PH Zug)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zug</td>
<td></td>
<td>https://www.zg.ch/behoerden/direktion-fur-bild...</td>
<td>215</td>
<td>2013</td>
<td>0</td>
<td>PH Zug</td>
<td></td>
<td>0000 0004 0449 2225</td>
<td></td>
<td></td>
<td></td>
<td>grid.466274.5</td>
</tr>
<tr>
<td>40</td>
<td>https://ror.org/02crff812</td>
<td>University of Zurich (UZH)</td>
<td></td>
<td>Université de zurich</td>
<td>Universität Zürich</td>
<td>Università di Zurigo</td>
<td>http://www.uzh.ch/index_en.html</td>
<td>215</td>
<td>1833</td>
<td>0</td>
<td>UZH</td>
<td></td>
<td>0000 0004 1937 0650</td>
<td>501100006447</td>
<td>314803</td>
<td>Q206702</td>
<td>grid.7400.3</td>
</tr>
<tr>
<td>41</td>
<td>https://ror.org/05pmsvm27</td>
<td>Zurich University of Applied Sciences (ZHAW)</td>
<td></td>
<td></td>
<td>Zürcher Hochschule für Angewandte Wissenschaften</td>
<td></td>
<td>https://www.zhaw.ch/en/university/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>ZHAW</td>
<td></td>
<td>0000000122291644</td>
<td></td>
<td>30930550</td>
<td>Q2605554</td>
<td>grid.19739.35</td>
</tr>
<tr>
<td>42</td>
<td>https://ror.org/02ejkey04</td>
<td>Zurich University of Applied Sciences in Busin...</td>
<td></td>
<td></td>
<td>Hochschule für Wirtschaft Zürich</td>
<td></td>
<td>http://www.fh-hwz.ch/en</td>
<td>215</td>
<td>1986</td>
<td>0</td>
<td>HWZ</td>
<td></td>
<td>0000 0001 0008 3713</td>
<td></td>
<td>30805829</td>
<td>Q1488771</td>
<td>grid.449909.9</td>
</tr>
<tr>
<td>43</td>
<td>https://ror.org/01awgk221</td>
<td>Zurich University of Teacher Education (PHZH)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zürich</td>
<td></td>
<td>https://phzh.ch/en/</td>
<td>215</td>
<td>2002</td>
<td>0</td>
<td>PHZH</td>
<td>PH Zürich</td>
<td>0000 0000 9666 1858</td>
<td></td>
<td></td>
<td></td>
<td>grid.483054.e</td>
</tr>
<tr>
<td>44</td>
<td>https://ror.org/05r0ap620</td>
<td>Zurich University of the Arts</td>
<td></td>
<td>Haute École d'Art de Zurich</td>
<td>Zürcher Hochschule der Künste</td>
<td></td>
<td>https://www.zhdk.ch/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>39250592</td>
<td>Q222450</td>
<td>grid.449912.3</td>
</tr>
<tr>
<td>45</td>
<td>https://ror.org/02s376052</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td></td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>0000000121839049</td>
<td>501100001703</td>
<td>71968</td>
<td>Q262760</td>
<td>grid.5333.6</td>
</tr>
</tbody>
</table>
</div>
```python
# mettre l'EPFL en position 1 et UNIGE en 2
target_row = 32
# Move target row to first element of list.
idx = [target_row] + [i for i in range(len(organization)) if i != target_row]
organization = organization.iloc[idx]
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ror</th>
<th>name</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_de</th>
<th>label_it</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>acronym</th>
<th>aliases</th>
<th>isni</th>
<th>fundref</th>
<th>orgref</th>
<th>wikidata</th>
<th>grid</th>
</tr>
</thead>
<tbody>
<tr>
<td>32</td>
<td>https://ror.org/01swzsf04</td>
<td>University of Geneva (UNIGE)</td>
<td></td>
<td>Université de Genève</td>
<td></td>
<td>Università di Ginevra</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>0000 0001 2322 4988</td>
<td>501100006389</td>
<td>342348</td>
<td>Q503473</td>
<td>grid.8591.5</td>
</tr>
<tr>
<td>0</td>
<td>https://ror.org/04d8ztx87</td>
<td>Agroscope</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 4681 910X</td>
<td></td>
<td></td>
<td>Q397466</td>
<td>grid.417771.3</td>
</tr>
<tr>
<td>1</td>
<td>https://ror.org/02bnkt322</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>BFH</td>
<td></td>
<td>0000 0001 0688 6779</td>
<td>501100006259</td>
<td>4365265</td>
<td>Q466455</td>
<td>grid.424060.4</td>
</tr>
<tr>
<td>2</td>
<td>https://ror.org/05a28rw58</td>
<td>ETH Zurich (ETH Zurich)</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td>Politecnico federale di Zurigo</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>0000 0001 2156 2780</td>
<td>501100003006</td>
<td>210910</td>
<td>Q11942</td>
<td>grid.5801.c</td>
</tr>
<tr>
<td>3</td>
<td>https://ror.org/01ggx4157</td>
<td>European Organization for Nuclear Research (CERN)</td>
<td></td>
<td>Organisation européenne pour la recherche nucl...</td>
<td>Europäische Organisation für Kernforschung</td>
<td></td>
<td>http://home.web.cern.ch/</td>
<td>215</td>
<td>1954</td>
<td>0</td>
<td>CERN</td>
<td></td>
<td>0000 0001 2156 142X</td>
<td></td>
<td>37351</td>
<td>Q42944</td>
<td>grid.9132.9</td>
</tr>
<tr>
<td>4</td>
<td>https://ror.org/0210tb741</td>
<td>Forschungsinstitut für biologischen Landbau (F...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.fibl.org/en/germany/location-de.html</td>
<td>215</td>
<td></td>
<td>0</td>
<td>FiBL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.506220.3</td>
</tr>
<tr>
<td>5</td>
<td>https://ror.org/007ygn379</td>
<td>Graduate Institute of International and Develo...</td>
<td></td>
<td>Institut de Hautes études Internationales et d...</td>
<td>Hochschulinstitut für internationale Studien u...</td>
<td></td>
<td>http://graduateinstitute.ch/home.html</td>
<td>215</td>
<td>1927</td>
<td>0</td>
<td>IHEID</td>
<td>Graduate Institute Geneva</td>
<td>0000 0001 2296 9873</td>
<td></td>
<td>14744053</td>
<td>Q691686</td>
<td>grid.424404.2</td>
</tr>
<tr>
<td>6</td>
<td>https://ror.org/015pmkr43</td>
<td>Haute École Pédagogique BEJUNE (HEP BEJUNE)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hep-bejune.ch/</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP BEJUNE</td>
<td></td>
<td>0000 0001 0658 3479</td>
<td></td>
<td></td>
<td></td>
<td>grid.469449.2</td>
</tr>
<tr>
<td>7</td>
<td>https://ror.org/048gre751</td>
<td>Haute École Pédagogique Fribourg (HEP-PH FR)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.hepfr.ch/</td>
<td>215</td>
<td>1990</td>
<td>0</td>
<td>HEP-PH FR</td>
<td></td>
<td>0000 0001 0266 4909</td>
<td></td>
<td></td>
<td></td>
<td>grid.469451.b</td>
</tr>
<tr>
<td>8</td>
<td>https://ror.org/01bvm0h13</td>
<td>Haute École Pédagogique du Canton de Vaud (HEP...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hepl.ch/cms/accueil.html</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP Vaud</td>
<td></td>
<td>0000 0004 0613 4050</td>
<td></td>
<td></td>
<td></td>
<td>grid.466224.0</td>
</tr>
<tr>
<td>9</td>
<td>https://ror.org/049c2kr37</td>
<td>Kalaidos University of Applied Sciences (Kalai...</td>
<td></td>
<td></td>
<td>Kalaidos Fachhochschule</td>
<td></td>
<td>https://www.kalaidos-fh.ch/de-CH</td>
<td>215</td>
<td>1995</td>
<td>0</td>
<td>Kalaidos UAS</td>
<td></td>
<td>0000 0004 0453 9054</td>
<td></td>
<td>6746630</td>
<td>Q681372</td>
<td>grid.449532.d</td>
</tr>
<tr>
<td>10</td>
<td>https://ror.org/021f7p178</td>
<td>Lib4RI - Library for the Research Institutes w...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.lib4ri.ch/</td>
<td>215</td>
<td>2011</td>
<td>0</td>
<td></td>
<td>Lib4RI</td>
<td>0000 0004 0624 8541</td>
<td></td>
<td></td>
<td>Q1278450</td>
<td>grid.458352.d</td>
</tr>
<tr>
<td>11</td>
<td>https://ror.org/04nd0xd48</td>
<td>Lucerne University of Applied Sciences and Arts</td>
<td></td>
<td>Haute École de lucerne</td>
<td>Hochschule Luzern</td>
<td></td>
<td>https://www.hslu.ch/en/</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2191 8943</td>
<td></td>
<td>19480920</td>
<td>Q664028</td>
<td>grid.425064.1</td>
</tr>
<tr>
<td>12</td>
<td>https://ror.org/00p9jf779</td>
<td>Medicines for Malaria Venture (MMV)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.mmv.org/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td>MMV</td>
<td></td>
<td>0000 0004 0432 5267</td>
<td>501100004167</td>
<td></td>
<td>Q6806774</td>
<td>grid.452605.0</td>
</tr>
<tr>
<td>13</td>
<td>https://ror.org/038mj2660</td>
<td>Ostschweizer Fachhochschule OST</td>
<td>Eastern Switzerland University of Applied Scie...</td>
<td></td>
<td></td>
<td></td>
<td>https://www.ost.ch/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.510272.3</td>
</tr>
<tr>
<td>14</td>
<td>https://ror.org/05jf1ma54</td>
<td>Pädagogische Hochschule Bern</td>
<td>Bern University of Teacher Education</td>
<td></td>
<td></td>
<td></td>
<td>https://www.phbern.ch</td>
<td>215</td>
<td>2005</td>
<td>0</td>
<td></td>
<td>PHBern</td>
<td>0000 0000 8585 5665</td>
<td></td>
<td></td>
<td></td>
<td>grid.454333.6</td>
</tr>
<tr>
<td>15</td>
<td>https://ror.org/02fjgft97</td>
<td>Pädagogische Hochschule Graubünden (PHGR)</td>
<td></td>
<td></td>
<td></td>
<td>Alta scuola pedagogica dei Grigioni</td>
<td>http://www.phgr.ch/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHGR</td>
<td></td>
<td>0000 0000 9317 283X</td>
<td></td>
<td></td>
<td></td>
<td>grid.469478.0</td>
</tr>
<tr>
<td>16</td>
<td>https://ror.org/03fs41j10</td>
<td>Pädagogische Hochschule Schaffhausen (PHSH)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phsh.ch/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHSH</td>
<td></td>
<td>0000 0004 0450 7546</td>
<td></td>
<td></td>
<td></td>
<td>grid.466133.5</td>
</tr>
<tr>
<td>17</td>
<td>https://ror.org/04bf6dq94</td>
<td>Pädagogische Hochschule Thurgau (PHTG)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phtg.ch/home/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHTG</td>
<td></td>
<td>0000 0004 0613 3824</td>
<td></td>
<td></td>
<td></td>
<td>grid.466322.7</td>
</tr>
<tr>
<td>18</td>
<td>https://ror.org/040gs8e06</td>
<td>Pädagogische Hochschule Wallis (PH-VS)</td>
<td></td>
<td>Haute École Pédagogique du Valais</td>
<td></td>
<td></td>
<td>http://www.hepvs.ch/de</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>PH-VS</td>
<td></td>
<td>0000 0001 2178 3217</td>
<td></td>
<td></td>
<td></td>
<td>grid.466216.1</td>
</tr>
<tr>
<td>19</td>
<td>https://ror.org/00rqdn375</td>
<td>Schwyz University of Teacher Education (PHSZ)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Schwyz</td>
<td></td>
<td>https://www.phsz.ch/en/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHSZ</td>
<td>PHZ Schwyz</td>
<td>0000 0004 0613 7454</td>
<td></td>
<td></td>
<td></td>
<td>grid.466169.a</td>
</tr>
<tr>
<td>20</td>
<td>https://ror.org/05m37v666</td>
<td>St.Gallen University of Teacher Education (PHSG)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule St. Gallen</td>
<td></td>
<td>https://www.phsg.ch/en</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>PHSG</td>
<td></td>
<td>0000 0001 0271 5139</td>
<td></td>
<td></td>
<td>Q1768652</td>
<td>grid.466208.e</td>
</tr>
<tr>
<td>21</td>
<td>https://ror.org/00zg4za48</td>
<td>Swiss Federal Institute for Vocational Educati...</td>
<td></td>
<td>Institut Fédéral des Hautes Études en Formatio...</td>
<td>Eidgenössisches Hochschulinstitut für Berufsbi...</td>
<td></td>
<td>http://www.ehb-schweiz.ch/en/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>SFIVET</td>
<td></td>
<td>0000 0001 2285 5681</td>
<td></td>
<td></td>
<td>Q1302632</td>
<td>grid.466173.1</td>
</tr>
<tr>
<td>22</td>
<td>https://ror.org/03mcsbr76</td>
<td>Swiss Ornithological Institute</td>
<td></td>
<td></td>
<td>Schweizerische Vogelwarte</td>
<td></td>
<td>http://www.vogelwarte.ch/de/home/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 1512 3677</td>
<td></td>
<td></td>
<td>Q663638</td>
<td>grid.419767.a</td>
</tr>
<tr>
<td>23</td>
<td>https://ror.org/03c4atk17</td>
<td>Universita della Svizzera Italiana (USI)</td>
<td>University of Italian Switzerland</td>
<td>Université de la suisse italienne</td>
<td></td>
<td>Università della Svizzera italiana</td>
<td>http://www.usi.ch/en/index.htm</td>
<td>215</td>
<td>1996</td>
<td>0</td>
<td>USI</td>
<td></td>
<td>0000 0001 2203 2861</td>
<td></td>
<td>2290642</td>
<td>Q689617</td>
<td>grid.29078.34</td>
</tr>
<tr>
<td>24</td>
<td>https://ror.org/04mq2g308</td>
<td>University of Applied Sciences and Arts Northw...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.fhnw.ch/homepage</td>
<td>215</td>
<td>2006</td>
<td>0</td>
<td>FHNW</td>
<td>Fachhochschule Nordwestschweiz</td>
<td>0000 0001 1497 8091</td>
<td></td>
<td></td>
<td></td>
<td>grid.410380.e</td>
</tr>
<tr>
<td>25</td>
<td>https://ror.org/01xkakk17</td>
<td>University of Applied Sciences and Arts Wester...</td>
<td></td>
<td>Haute École Spécialisée de Suisse Occidentale</td>
<td>Fachhochschule Westschweiz</td>
<td></td>
<td>http://www.hes-so.ch/en/homepage-hes-so-1679.html</td>
<td>215</td>
<td>1998</td>
<td>0</td>
<td>HES-SO</td>
<td></td>
<td>0000 0001 0943 1999</td>
<td></td>
<td>10128956</td>
<td>Q168003</td>
<td>grid.5681.a</td>
</tr>
<tr>
<td>26</td>
<td>https://ror.org/05ep8g269</td>
<td>University of Applied Sciences and Arts of Sou...</td>
<td></td>
<td></td>
<td></td>
<td>Scuola Universitaria Professionale della Svizz...</td>
<td>http://www.supsi.ch/home_en.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>SUPSI</td>
<td></td>
<td>0000000123252233</td>
<td></td>
<td>34066841</td>
<td>Q663984</td>
<td>grid.16058.3a</td>
</tr>
<tr>
<td>27</td>
<td>https://ror.org/00w9q2c06</td>
<td>University of Applied Sciences of Special Need...</td>
<td></td>
<td></td>
<td>Interkantonale Hochschule für Heilpädagogik</td>
<td></td>
<td>http://www.hfh.ch/en/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td>HfH</td>
<td>Zurich Training College for Teachers of Specia...</td>
<td>0000 0001 0710 6332</td>
<td></td>
<td></td>
<td></td>
<td>grid.466279.8</td>
</tr>
<tr>
<td>28</td>
<td>https://ror.org/032ymzc07</td>
<td>University of Applied Sciences of the Grisons</td>
<td></td>
<td></td>
<td>Fachhochschule Graubünden</td>
<td></td>
<td>https://www.fhgr.ch/en/</td>
<td>215</td>
<td>1963</td>
<td>0</td>
<td></td>
<td>Hochschule für Technik und Wirtschaft Chur</td>
<td>0000 0000 8718 2812</td>
<td></td>
<td></td>
<td>Q1622220</td>
<td>grid.460104.7</td>
</tr>
<tr>
<td>29</td>
<td>https://ror.org/02s6k3f65</td>
<td>University of Basel</td>
<td></td>
<td>Université de bâle</td>
<td>Universität Basel</td>
<td>Università di Basilea</td>
<td>https://www.unibas.ch/de</td>
<td>215</td>
<td>1460</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 1937 0642</td>
<td>100008375</td>
<td>427614</td>
<td>Q372608</td>
<td>grid.6612.3</td>
</tr>
<tr>
<td>30</td>
<td>https://ror.org/02k7v4d05</td>
<td>University of Bern (UB)</td>
<td></td>
<td>Université de Berne</td>
<td>Universität Bern</td>
<td>Università di Berna</td>
<td>http://www.unibe.ch/eng/</td>
<td>215</td>
<td>1834</td>
<td>0</td>
<td>UB</td>
<td></td>
<td>0000 0001 0726 5157</td>
<td>100009068</td>
<td>1157515</td>
<td>Q659080</td>
<td>grid.5734.5</td>
</tr>
<tr>
<td>31</td>
<td>https://ror.org/022fs9h90</td>
<td>University of Fribourg</td>
<td></td>
<td>Université de Fribourg</td>
<td>Universität Freiburg</td>
<td>Università di Friburgo</td>
<td>http://www.unifr.ch/home/welcomeE.php</td>
<td>215</td>
<td>1889</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 0478 1713</td>
<td>501100005869</td>
<td>535267</td>
<td>Q36188</td>
<td>grid.8534.a</td>
</tr>
<tr>
<td>33</td>
<td>https://ror.org/019whta54</td>
<td>University of Lausanne (UNIL)</td>
<td></td>
<td>Université de Lausanne</td>
<td>Universität Lausanne</td>
<td>Università di Losanna</td>
<td>http://www.unil.ch/central/en/home.html</td>
<td>215</td>
<td>1537</td>
<td>0</td>
<td>UNIL</td>
<td>Schola Lausannensis</td>
<td>0000 0001 2165 4204</td>
<td>501100006390</td>
<td>79810</td>
<td>Q658975</td>
<td>grid.9851.5</td>
</tr>
<tr>
<td>34</td>
<td>https://ror.org/01qjrx392</td>
<td>University of Liechtenstein</td>
<td></td>
<td></td>
<td>Universität Liechtenstein</td>
<td></td>
<td>https://www.uni.li/study/de/</td>
<td>128</td>
<td>1961</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2227 4668</td>
<td></td>
<td>10554064</td>
<td>Q974328</td>
<td>grid.445905.9</td>
</tr>
<tr>
<td>35</td>
<td>https://ror.org/00kgrkn83</td>
<td>University of Lucerne (UNILU)</td>
<td></td>
<td>Université de lucerne</td>
<td>Universität Luzern</td>
<td>Università di Lucerna</td>
<td>http://www.unilu.ch/</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>UNILU</td>
<td></td>
<td>0000 0001 1456 7938</td>
<td></td>
<td>21004764</td>
<td>Q673308</td>
<td>grid.449852.6</td>
</tr>
<tr>
<td>36</td>
<td>https://ror.org/00vasag41</td>
<td>University of Neuchâtel</td>
<td></td>
<td>Université de neuchâtel</td>
<td>Universität Neuenburg</td>
<td></td>
<td>http://www2.unine.ch/</td>
<td>215</td>
<td>1838</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2297 7718</td>
<td>501100005353</td>
<td>3662101</td>
<td>Q541548</td>
<td>grid.10711.36</td>
</tr>
<tr>
<td>37</td>
<td>https://ror.org/0561a3s31</td>
<td>University of St. Gallen (HSG)</td>
<td></td>
<td>Université de saint-gall</td>
<td>Universität St. Gallen</td>
<td>Università di San Gallo</td>
<td>http://www.es.unisg.ch/en/</td>
<td>215</td>
<td>1898</td>
<td>0</td>
<td>HSG</td>
<td></td>
<td>0000 0001 2156 6618</td>
<td>100009572</td>
<td>751473</td>
<td>Q673354</td>
<td>grid.15775.31</td>
</tr>
<tr>
<td>38</td>
<td>https://ror.org/0235ynq74</td>
<td>University of Teacher Education Lucerne</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Luzern</td>
<td></td>
<td>http://www.phlu.ch/ute-lucerne/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td></td>
<td>PH Luzern</td>
<td>0000 0001 0348 1637</td>
<td></td>
<td></td>
<td></td>
<td>grid.465965.d</td>
</tr>
<tr>
<td>39</td>
<td>https://ror.org/05ghhx264</td>
<td>University of Teacher Education Zug (PH Zug)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zug</td>
<td></td>
<td>https://www.zg.ch/behoerden/direktion-fur-bild...</td>
<td>215</td>
<td>2013</td>
<td>0</td>
<td>PH Zug</td>
<td></td>
<td>0000 0004 0449 2225</td>
<td></td>
<td></td>
<td></td>
<td>grid.466274.5</td>
</tr>
<tr>
<td>40</td>
<td>https://ror.org/02crff812</td>
<td>University of Zurich (UZH)</td>
<td></td>
<td>Université de zurich</td>
<td>Universität Zürich</td>
<td>Università di Zurigo</td>
<td>http://www.uzh.ch/index_en.html</td>
<td>215</td>
<td>1833</td>
<td>0</td>
<td>UZH</td>
<td></td>
<td>0000 0004 1937 0650</td>
<td>501100006447</td>
<td>314803</td>
<td>Q206702</td>
<td>grid.7400.3</td>
</tr>
<tr>
<td>41</td>
<td>https://ror.org/05pmsvm27</td>
<td>Zurich University of Applied Sciences (ZHAW)</td>
<td></td>
<td></td>
<td>Zürcher Hochschule für Angewandte Wissenschaften</td>
<td></td>
<td>https://www.zhaw.ch/en/university/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>ZHAW</td>
<td></td>
<td>0000000122291644</td>
<td></td>
<td>30930550</td>
<td>Q2605554</td>
<td>grid.19739.35</td>
</tr>
<tr>
<td>42</td>
<td>https://ror.org/02ejkey04</td>
<td>Zurich University of Applied Sciences in Busin...</td>
<td></td>
<td></td>
<td>Hochschule für Wirtschaft Zürich</td>
<td></td>
<td>http://www.fh-hwz.ch/en</td>
<td>215</td>
<td>1986</td>
<td>0</td>
<td>HWZ</td>
<td></td>
<td>0000 0001 0008 3713</td>
<td></td>
<td>30805829</td>
<td>Q1488771</td>
<td>grid.449909.9</td>
</tr>
<tr>
<td>43</td>
<td>https://ror.org/01awgk221</td>
<td>Zurich University of Teacher Education (PHZH)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zürich</td>
<td></td>
<td>https://phzh.ch/en/</td>
<td>215</td>
<td>2002</td>
<td>0</td>
<td>PHZH</td>
<td>PH Zürich</td>
<td>0000 0000 9666 1858</td>
<td></td>
<td></td>
<td></td>
<td>grid.483054.e</td>
</tr>
<tr>
<td>44</td>
<td>https://ror.org/05r0ap620</td>
<td>Zurich University of the Arts</td>
<td></td>
<td>Haute École d'Art de Zurich</td>
<td>Zürcher Hochschule der Künste</td>
<td></td>
<td>https://www.zhdk.ch/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>39250592</td>
<td>Q222450</td>
<td>grid.449912.3</td>
</tr>
<tr>
<td>45</td>
<td>https://ror.org/02s376052</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td></td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>0000000121839049</td>
<td>501100001703</td>
<td>71968</td>
<td>Q262760</td>
<td>grid.5333.6</td>
</tr>
</tbody>
</table>
</div>
```python
organization = organization.reset_index(drop=True)
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ror</th>
<th>name</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_de</th>
<th>label_it</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>acronym</th>
<th>aliases</th>
<th>isni</th>
<th>fundref</th>
<th>orgref</th>
<th>wikidata</th>
<th>grid</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>https://ror.org/01swzsf04</td>
<td>University of Geneva (UNIGE)</td>
<td></td>
<td>Université de Genève</td>
<td></td>
<td>Università di Ginevra</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>0000 0001 2322 4988</td>
<td>501100006389</td>
<td>342348</td>
<td>Q503473</td>
<td>grid.8591.5</td>
</tr>
<tr>
<td>1</td>
<td>https://ror.org/04d8ztx87</td>
<td>Agroscope</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 4681 910X</td>
<td></td>
<td></td>
<td>Q397466</td>
<td>grid.417771.3</td>
</tr>
<tr>
<td>2</td>
<td>https://ror.org/02bnkt322</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>BFH</td>
<td></td>
<td>0000 0001 0688 6779</td>
<td>501100006259</td>
<td>4365265</td>
<td>Q466455</td>
<td>grid.424060.4</td>
</tr>
<tr>
<td>3</td>
<td>https://ror.org/05a28rw58</td>
<td>ETH Zurich (ETH Zurich)</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td>Politecnico federale di Zurigo</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>0000 0001 2156 2780</td>
<td>501100003006</td>
<td>210910</td>
<td>Q11942</td>
<td>grid.5801.c</td>
</tr>
<tr>
<td>4</td>
<td>https://ror.org/01ggx4157</td>
<td>European Organization for Nuclear Research (CERN)</td>
<td></td>
<td>Organisation européenne pour la recherche nucl...</td>
<td>Europäische Organisation für Kernforschung</td>
<td></td>
<td>http://home.web.cern.ch/</td>
<td>215</td>
<td>1954</td>
<td>0</td>
<td>CERN</td>
<td></td>
<td>0000 0001 2156 142X</td>
<td></td>
<td>37351</td>
<td>Q42944</td>
<td>grid.9132.9</td>
</tr>
<tr>
<td>5</td>
<td>https://ror.org/0210tb741</td>
<td>Forschungsinstitut für biologischen Landbau (F...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.fibl.org/en/germany/location-de.html</td>
<td>215</td>
<td></td>
<td>0</td>
<td>FiBL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.506220.3</td>
</tr>
<tr>
<td>6</td>
<td>https://ror.org/007ygn379</td>
<td>Graduate Institute of International and Develo...</td>
<td></td>
<td>Institut de Hautes études Internationales et d...</td>
<td>Hochschulinstitut für internationale Studien u...</td>
<td></td>
<td>http://graduateinstitute.ch/home.html</td>
<td>215</td>
<td>1927</td>
<td>0</td>
<td>IHEID</td>
<td>Graduate Institute Geneva</td>
<td>0000 0001 2296 9873</td>
<td></td>
<td>14744053</td>
<td>Q691686</td>
<td>grid.424404.2</td>
</tr>
<tr>
<td>7</td>
<td>https://ror.org/015pmkr43</td>
<td>Haute École Pédagogique BEJUNE (HEP BEJUNE)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hep-bejune.ch/</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP BEJUNE</td>
<td></td>
<td>0000 0001 0658 3479</td>
<td></td>
<td></td>
<td></td>
<td>grid.469449.2</td>
</tr>
<tr>
<td>8</td>
<td>https://ror.org/048gre751</td>
<td>Haute École Pédagogique Fribourg (HEP-PH FR)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.hepfr.ch/</td>
<td>215</td>
<td>1990</td>
<td>0</td>
<td>HEP-PH FR</td>
<td></td>
<td>0000 0001 0266 4909</td>
<td></td>
<td></td>
<td></td>
<td>grid.469451.b</td>
</tr>
<tr>
<td>9</td>
<td>https://ror.org/01bvm0h13</td>
<td>Haute École Pédagogique du Canton de Vaud (HEP...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hepl.ch/cms/accueil.html</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP Vaud</td>
<td></td>
<td>0000 0004 0613 4050</td>
<td></td>
<td></td>
<td></td>
<td>grid.466224.0</td>
</tr>
<tr>
<td>10</td>
<td>https://ror.org/049c2kr37</td>
<td>Kalaidos University of Applied Sciences (Kalai...</td>
<td></td>
<td></td>
<td>Kalaidos Fachhochschule</td>
<td></td>
<td>https://www.kalaidos-fh.ch/de-CH</td>
<td>215</td>
<td>1995</td>
<td>0</td>
<td>Kalaidos UAS</td>
<td></td>
<td>0000 0004 0453 9054</td>
<td></td>
<td>6746630</td>
<td>Q681372</td>
<td>grid.449532.d</td>
</tr>
<tr>
<td>11</td>
<td>https://ror.org/021f7p178</td>
<td>Lib4RI - Library for the Research Institutes w...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.lib4ri.ch/</td>
<td>215</td>
<td>2011</td>
<td>0</td>
<td></td>
<td>Lib4RI</td>
<td>0000 0004 0624 8541</td>
<td></td>
<td></td>
<td>Q1278450</td>
<td>grid.458352.d</td>
</tr>
<tr>
<td>12</td>
<td>https://ror.org/04nd0xd48</td>
<td>Lucerne University of Applied Sciences and Arts</td>
<td></td>
<td>Haute École de lucerne</td>
<td>Hochschule Luzern</td>
<td></td>
<td>https://www.hslu.ch/en/</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2191 8943</td>
<td></td>
<td>19480920</td>
<td>Q664028</td>
<td>grid.425064.1</td>
</tr>
<tr>
<td>13</td>
<td>https://ror.org/00p9jf779</td>
<td>Medicines for Malaria Venture (MMV)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.mmv.org/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td>MMV</td>
<td></td>
<td>0000 0004 0432 5267</td>
<td>501100004167</td>
<td></td>
<td>Q6806774</td>
<td>grid.452605.0</td>
</tr>
<tr>
<td>14</td>
<td>https://ror.org/038mj2660</td>
<td>Ostschweizer Fachhochschule OST</td>
<td>Eastern Switzerland University of Applied Scie...</td>
<td></td>
<td></td>
<td></td>
<td>https://www.ost.ch/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.510272.3</td>
</tr>
<tr>
<td>15</td>
<td>https://ror.org/05jf1ma54</td>
<td>Pädagogische Hochschule Bern</td>
<td>Bern University of Teacher Education</td>
<td></td>
<td></td>
<td></td>
<td>https://www.phbern.ch</td>
<td>215</td>
<td>2005</td>
<td>0</td>
<td></td>
<td>PHBern</td>
<td>0000 0000 8585 5665</td>
<td></td>
<td></td>
<td></td>
<td>grid.454333.6</td>
</tr>
<tr>
<td>16</td>
<td>https://ror.org/02fjgft97</td>
<td>Pädagogische Hochschule Graubünden (PHGR)</td>
<td></td>
<td></td>
<td></td>
<td>Alta scuola pedagogica dei Grigioni</td>
<td>http://www.phgr.ch/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHGR</td>
<td></td>
<td>0000 0000 9317 283X</td>
<td></td>
<td></td>
<td></td>
<td>grid.469478.0</td>
</tr>
<tr>
<td>17</td>
<td>https://ror.org/03fs41j10</td>
<td>Pädagogische Hochschule Schaffhausen (PHSH)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phsh.ch/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHSH</td>
<td></td>
<td>0000 0004 0450 7546</td>
<td></td>
<td></td>
<td></td>
<td>grid.466133.5</td>
</tr>
<tr>
<td>18</td>
<td>https://ror.org/04bf6dq94</td>
<td>Pädagogische Hochschule Thurgau (PHTG)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phtg.ch/home/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHTG</td>
<td></td>
<td>0000 0004 0613 3824</td>
<td></td>
<td></td>
<td></td>
<td>grid.466322.7</td>
</tr>
<tr>
<td>19</td>
<td>https://ror.org/040gs8e06</td>
<td>Pädagogische Hochschule Wallis (PH-VS)</td>
<td></td>
<td>Haute École Pédagogique du Valais</td>
<td></td>
<td></td>
<td>http://www.hepvs.ch/de</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>PH-VS</td>
<td></td>
<td>0000 0001 2178 3217</td>
<td></td>
<td></td>
<td></td>
<td>grid.466216.1</td>
</tr>
<tr>
<td>20</td>
<td>https://ror.org/00rqdn375</td>
<td>Schwyz University of Teacher Education (PHSZ)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Schwyz</td>
<td></td>
<td>https://www.phsz.ch/en/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHSZ</td>
<td>PHZ Schwyz</td>
<td>0000 0004 0613 7454</td>
<td></td>
<td></td>
<td></td>
<td>grid.466169.a</td>
</tr>
<tr>
<td>21</td>
<td>https://ror.org/05m37v666</td>
<td>St.Gallen University of Teacher Education (PHSG)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule St. Gallen</td>
<td></td>
<td>https://www.phsg.ch/en</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>PHSG</td>
<td></td>
<td>0000 0001 0271 5139</td>
<td></td>
<td></td>
<td>Q1768652</td>
<td>grid.466208.e</td>
</tr>
<tr>
<td>22</td>
<td>https://ror.org/00zg4za48</td>
<td>Swiss Federal Institute for Vocational Educati...</td>
<td></td>
<td>Institut Fédéral des Hautes Études en Formatio...</td>
<td>Eidgenössisches Hochschulinstitut für Berufsbi...</td>
<td></td>
<td>http://www.ehb-schweiz.ch/en/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>SFIVET</td>
<td></td>
<td>0000 0001 2285 5681</td>
<td></td>
<td></td>
<td>Q1302632</td>
<td>grid.466173.1</td>
</tr>
<tr>
<td>23</td>
<td>https://ror.org/03mcsbr76</td>
<td>Swiss Ornithological Institute</td>
<td></td>
<td></td>
<td>Schweizerische Vogelwarte</td>
<td></td>
<td>http://www.vogelwarte.ch/de/home/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 1512 3677</td>
<td></td>
<td></td>
<td>Q663638</td>
<td>grid.419767.a</td>
</tr>
<tr>
<td>24</td>
<td>https://ror.org/03c4atk17</td>
<td>Universita della Svizzera Italiana (USI)</td>
<td>University of Italian Switzerland</td>
<td>Université de la suisse italienne</td>
<td></td>
<td>Università della Svizzera italiana</td>
<td>http://www.usi.ch/en/index.htm</td>
<td>215</td>
<td>1996</td>
<td>0</td>
<td>USI</td>
<td></td>
<td>0000 0001 2203 2861</td>
<td></td>
<td>2290642</td>
<td>Q689617</td>
<td>grid.29078.34</td>
</tr>
<tr>
<td>25</td>
<td>https://ror.org/04mq2g308</td>
<td>University of Applied Sciences and Arts Northw...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.fhnw.ch/homepage</td>
<td>215</td>
<td>2006</td>
<td>0</td>
<td>FHNW</td>
<td>Fachhochschule Nordwestschweiz</td>
<td>0000 0001 1497 8091</td>
<td></td>
<td></td>
<td></td>
<td>grid.410380.e</td>
</tr>
<tr>
<td>26</td>
<td>https://ror.org/01xkakk17</td>
<td>University of Applied Sciences and Arts Wester...</td>
<td></td>
<td>Haute École Spécialisée de Suisse Occidentale</td>
<td>Fachhochschule Westschweiz</td>
<td></td>
<td>http://www.hes-so.ch/en/homepage-hes-so-1679.html</td>
<td>215</td>
<td>1998</td>
<td>0</td>
<td>HES-SO</td>
<td></td>
<td>0000 0001 0943 1999</td>
<td></td>
<td>10128956</td>
<td>Q168003</td>
<td>grid.5681.a</td>
</tr>
<tr>
<td>27</td>
<td>https://ror.org/05ep8g269</td>
<td>University of Applied Sciences and Arts of Sou...</td>
<td></td>
<td></td>
<td></td>
<td>Scuola Universitaria Professionale della Svizz...</td>
<td>http://www.supsi.ch/home_en.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>SUPSI</td>
<td></td>
<td>0000000123252233</td>
<td></td>
<td>34066841</td>
<td>Q663984</td>
<td>grid.16058.3a</td>
</tr>
<tr>
<td>28</td>
<td>https://ror.org/00w9q2c06</td>
<td>University of Applied Sciences of Special Need...</td>
<td></td>
<td></td>
<td>Interkantonale Hochschule für Heilpädagogik</td>
<td></td>
<td>http://www.hfh.ch/en/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td>HfH</td>
<td>Zurich Training College for Teachers of Specia...</td>
<td>0000 0001 0710 6332</td>
<td></td>
<td></td>
<td></td>
<td>grid.466279.8</td>
</tr>
<tr>
<td>29</td>
<td>https://ror.org/032ymzc07</td>
<td>University of Applied Sciences of the Grisons</td>
<td></td>
<td></td>
<td>Fachhochschule Graubünden</td>
<td></td>
<td>https://www.fhgr.ch/en/</td>
<td>215</td>
<td>1963</td>
<td>0</td>
<td></td>
<td>Hochschule für Technik und Wirtschaft Chur</td>
<td>0000 0000 8718 2812</td>
<td></td>
<td></td>
<td>Q1622220</td>
<td>grid.460104.7</td>
</tr>
<tr>
<td>30</td>
<td>https://ror.org/02s6k3f65</td>
<td>University of Basel</td>
<td></td>
<td>Université de bâle</td>
<td>Universität Basel</td>
<td>Università di Basilea</td>
<td>https://www.unibas.ch/de</td>
<td>215</td>
<td>1460</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 1937 0642</td>
<td>100008375</td>
<td>427614</td>
<td>Q372608</td>
<td>grid.6612.3</td>
</tr>
<tr>
<td>31</td>
<td>https://ror.org/02k7v4d05</td>
<td>University of Bern (UB)</td>
<td></td>
<td>Université de Berne</td>
<td>Universität Bern</td>
<td>Università di Berna</td>
<td>http://www.unibe.ch/eng/</td>
<td>215</td>
<td>1834</td>
<td>0</td>
<td>UB</td>
<td></td>
<td>0000 0001 0726 5157</td>
<td>100009068</td>
<td>1157515</td>
<td>Q659080</td>
<td>grid.5734.5</td>
</tr>
<tr>
<td>32</td>
<td>https://ror.org/022fs9h90</td>
<td>University of Fribourg</td>
<td></td>
<td>Université de Fribourg</td>
<td>Universität Freiburg</td>
<td>Università di Friburgo</td>
<td>http://www.unifr.ch/home/welcomeE.php</td>
<td>215</td>
<td>1889</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 0478 1713</td>
<td>501100005869</td>
<td>535267</td>
<td>Q36188</td>
<td>grid.8534.a</td>
</tr>
<tr>
<td>33</td>
<td>https://ror.org/019whta54</td>
<td>University of Lausanne (UNIL)</td>
<td></td>
<td>Université de Lausanne</td>
<td>Universität Lausanne</td>
<td>Università di Losanna</td>
<td>http://www.unil.ch/central/en/home.html</td>
<td>215</td>
<td>1537</td>
<td>0</td>
<td>UNIL</td>
<td>Schola Lausannensis</td>
<td>0000 0001 2165 4204</td>
<td>501100006390</td>
<td>79810</td>
<td>Q658975</td>
<td>grid.9851.5</td>
</tr>
<tr>
<td>34</td>
<td>https://ror.org/01qjrx392</td>
<td>University of Liechtenstein</td>
<td></td>
<td></td>
<td>Universität Liechtenstein</td>
<td></td>
<td>https://www.uni.li/study/de/</td>
<td>128</td>
<td>1961</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2227 4668</td>
<td></td>
<td>10554064</td>
<td>Q974328</td>
<td>grid.445905.9</td>
</tr>
<tr>
<td>35</td>
<td>https://ror.org/00kgrkn83</td>
<td>University of Lucerne (UNILU)</td>
<td></td>
<td>Université de lucerne</td>
<td>Universität Luzern</td>
<td>Università di Lucerna</td>
<td>http://www.unilu.ch/</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>UNILU</td>
<td></td>
<td>0000 0001 1456 7938</td>
<td></td>
<td>21004764</td>
<td>Q673308</td>
<td>grid.449852.6</td>
</tr>
<tr>
<td>36</td>
<td>https://ror.org/00vasag41</td>
<td>University of Neuchâtel</td>
<td></td>
<td>Université de neuchâtel</td>
<td>Universität Neuenburg</td>
<td></td>
<td>http://www2.unine.ch/</td>
<td>215</td>
<td>1838</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2297 7718</td>
<td>501100005353</td>
<td>3662101</td>
<td>Q541548</td>
<td>grid.10711.36</td>
</tr>
<tr>
<td>37</td>
<td>https://ror.org/0561a3s31</td>
<td>University of St. Gallen (HSG)</td>
<td></td>
<td>Université de saint-gall</td>
<td>Universität St. Gallen</td>
<td>Università di San Gallo</td>
<td>http://www.es.unisg.ch/en/</td>
<td>215</td>
<td>1898</td>
<td>0</td>
<td>HSG</td>
<td></td>
<td>0000 0001 2156 6618</td>
<td>100009572</td>
<td>751473</td>
<td>Q673354</td>
<td>grid.15775.31</td>
</tr>
<tr>
<td>38</td>
<td>https://ror.org/0235ynq74</td>
<td>University of Teacher Education Lucerne</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Luzern</td>
<td></td>
<td>http://www.phlu.ch/ute-lucerne/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td></td>
<td>PH Luzern</td>
<td>0000 0001 0348 1637</td>
<td></td>
<td></td>
<td></td>
<td>grid.465965.d</td>
</tr>
<tr>
<td>39</td>
<td>https://ror.org/05ghhx264</td>
<td>University of Teacher Education Zug (PH Zug)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zug</td>
<td></td>
<td>https://www.zg.ch/behoerden/direktion-fur-bild...</td>
<td>215</td>
<td>2013</td>
<td>0</td>
<td>PH Zug</td>
<td></td>
<td>0000 0004 0449 2225</td>
<td></td>
<td></td>
<td></td>
<td>grid.466274.5</td>
</tr>
<tr>
<td>40</td>
<td>https://ror.org/02crff812</td>
<td>University of Zurich (UZH)</td>
<td></td>
<td>Université de zurich</td>
<td>Universität Zürich</td>
<td>Università di Zurigo</td>
<td>http://www.uzh.ch/index_en.html</td>
<td>215</td>
<td>1833</td>
<td>0</td>
<td>UZH</td>
<td></td>
<td>0000 0004 1937 0650</td>
<td>501100006447</td>
<td>314803</td>
<td>Q206702</td>
<td>grid.7400.3</td>
</tr>
<tr>
<td>41</td>
<td>https://ror.org/05pmsvm27</td>
<td>Zurich University of Applied Sciences (ZHAW)</td>
<td></td>
<td></td>
<td>Zürcher Hochschule für Angewandte Wissenschaften</td>
<td></td>
<td>https://www.zhaw.ch/en/university/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>ZHAW</td>
<td></td>
<td>0000000122291644</td>
<td></td>
<td>30930550</td>
<td>Q2605554</td>
<td>grid.19739.35</td>
</tr>
<tr>
<td>42</td>
<td>https://ror.org/02ejkey04</td>
<td>Zurich University of Applied Sciences in Busin...</td>
<td></td>
<td></td>
<td>Hochschule für Wirtschaft Zürich</td>
<td></td>
<td>http://www.fh-hwz.ch/en</td>
<td>215</td>
<td>1986</td>
<td>0</td>
<td>HWZ</td>
<td></td>
<td>0000 0001 0008 3713</td>
<td></td>
<td>30805829</td>
<td>Q1488771</td>
<td>grid.449909.9</td>
</tr>
<tr>
<td>43</td>
<td>https://ror.org/01awgk221</td>
<td>Zurich University of Teacher Education (PHZH)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zürich</td>
<td></td>
<td>https://phzh.ch/en/</td>
<td>215</td>
<td>2002</td>
<td>0</td>
<td>PHZH</td>
<td>PH Zürich</td>
<td>0000 0000 9666 1858</td>
<td></td>
<td></td>
<td></td>
<td>grid.483054.e</td>
</tr>
<tr>
<td>44</td>
<td>https://ror.org/05r0ap620</td>
<td>Zurich University of the Arts</td>
<td></td>
<td>Haute École d'Art de Zurich</td>
<td>Zürcher Hochschule der Künste</td>
<td></td>
<td>https://www.zhdk.ch/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>39250592</td>
<td>Q222450</td>
<td>grid.449912.3</td>
</tr>
<tr>
<td>45</td>
<td>https://ror.org/02s376052</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td></td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>0000000121839049</td>
<td>501100001703</td>
<td>71968</td>
<td>Q262760</td>
<td>grid.5333.6</td>
</tr>
</tbody>
</table>
</div>
```python
# mettre l'EPFL en position 1 et UNIGE en 2
target_row = 45
# Move target row to first element of list.
idx = [target_row] + [i for i in range(len(organization)) if i != target_row]
organization = organization.iloc[idx]
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ror</th>
<th>name</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_de</th>
<th>label_it</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>acronym</th>
<th>aliases</th>
<th>isni</th>
<th>fundref</th>
<th>orgref</th>
<th>wikidata</th>
<th>grid</th>
</tr>
</thead>
<tbody>
<tr>
<td>45</td>
<td>https://ror.org/02s376052</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td></td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>0000000121839049</td>
<td>501100001703</td>
<td>71968</td>
<td>Q262760</td>
<td>grid.5333.6</td>
</tr>
<tr>
<td>0</td>
<td>https://ror.org/01swzsf04</td>
<td>University of Geneva (UNIGE)</td>
<td></td>
<td>Université de Genève</td>
<td></td>
<td>Università di Ginevra</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>0000 0001 2322 4988</td>
<td>501100006389</td>
<td>342348</td>
<td>Q503473</td>
<td>grid.8591.5</td>
</tr>
<tr>
<td>1</td>
<td>https://ror.org/04d8ztx87</td>
<td>Agroscope</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 4681 910X</td>
<td></td>
<td></td>
<td>Q397466</td>
<td>grid.417771.3</td>
</tr>
<tr>
<td>2</td>
<td>https://ror.org/02bnkt322</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>BFH</td>
<td></td>
<td>0000 0001 0688 6779</td>
<td>501100006259</td>
<td>4365265</td>
<td>Q466455</td>
<td>grid.424060.4</td>
</tr>
<tr>
<td>3</td>
<td>https://ror.org/05a28rw58</td>
<td>ETH Zurich (ETH Zurich)</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td>Politecnico federale di Zurigo</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>0000 0001 2156 2780</td>
<td>501100003006</td>
<td>210910</td>
<td>Q11942</td>
<td>grid.5801.c</td>
</tr>
<tr>
<td>4</td>
<td>https://ror.org/01ggx4157</td>
<td>European Organization for Nuclear Research (CERN)</td>
<td></td>
<td>Organisation européenne pour la recherche nucl...</td>
<td>Europäische Organisation für Kernforschung</td>
<td></td>
<td>http://home.web.cern.ch/</td>
<td>215</td>
<td>1954</td>
<td>0</td>
<td>CERN</td>
<td></td>
<td>0000 0001 2156 142X</td>
<td></td>
<td>37351</td>
<td>Q42944</td>
<td>grid.9132.9</td>
</tr>
<tr>
<td>5</td>
<td>https://ror.org/0210tb741</td>
<td>Forschungsinstitut für biologischen Landbau (F...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.fibl.org/en/germany/location-de.html</td>
<td>215</td>
<td></td>
<td>0</td>
<td>FiBL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.506220.3</td>
</tr>
<tr>
<td>6</td>
<td>https://ror.org/007ygn379</td>
<td>Graduate Institute of International and Develo...</td>
<td></td>
<td>Institut de Hautes études Internationales et d...</td>
<td>Hochschulinstitut für internationale Studien u...</td>
<td></td>
<td>http://graduateinstitute.ch/home.html</td>
<td>215</td>
<td>1927</td>
<td>0</td>
<td>IHEID</td>
<td>Graduate Institute Geneva</td>
<td>0000 0001 2296 9873</td>
<td></td>
<td>14744053</td>
<td>Q691686</td>
<td>grid.424404.2</td>
</tr>
<tr>
<td>7</td>
<td>https://ror.org/015pmkr43</td>
<td>Haute École Pédagogique BEJUNE (HEP BEJUNE)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hep-bejune.ch/</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP BEJUNE</td>
<td></td>
<td>0000 0001 0658 3479</td>
<td></td>
<td></td>
<td></td>
<td>grid.469449.2</td>
</tr>
<tr>
<td>8</td>
<td>https://ror.org/048gre751</td>
<td>Haute École Pédagogique Fribourg (HEP-PH FR)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.hepfr.ch/</td>
<td>215</td>
<td>1990</td>
<td>0</td>
<td>HEP-PH FR</td>
<td></td>
<td>0000 0001 0266 4909</td>
<td></td>
<td></td>
<td></td>
<td>grid.469451.b</td>
</tr>
<tr>
<td>9</td>
<td>https://ror.org/01bvm0h13</td>
<td>Haute École Pédagogique du Canton de Vaud (HEP...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hepl.ch/cms/accueil.html</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP Vaud</td>
<td></td>
<td>0000 0004 0613 4050</td>
<td></td>
<td></td>
<td></td>
<td>grid.466224.0</td>
</tr>
<tr>
<td>10</td>
<td>https://ror.org/049c2kr37</td>
<td>Kalaidos University of Applied Sciences (Kalai...</td>
<td></td>
<td></td>
<td>Kalaidos Fachhochschule</td>
<td></td>
<td>https://www.kalaidos-fh.ch/de-CH</td>
<td>215</td>
<td>1995</td>
<td>0</td>
<td>Kalaidos UAS</td>
<td></td>
<td>0000 0004 0453 9054</td>
<td></td>
<td>6746630</td>
<td>Q681372</td>
<td>grid.449532.d</td>
</tr>
<tr>
<td>11</td>
<td>https://ror.org/021f7p178</td>
<td>Lib4RI - Library for the Research Institutes w...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.lib4ri.ch/</td>
<td>215</td>
<td>2011</td>
<td>0</td>
<td></td>
<td>Lib4RI</td>
<td>0000 0004 0624 8541</td>
<td></td>
<td></td>
<td>Q1278450</td>
<td>grid.458352.d</td>
</tr>
<tr>
<td>12</td>
<td>https://ror.org/04nd0xd48</td>
<td>Lucerne University of Applied Sciences and Arts</td>
<td></td>
<td>Haute École de lucerne</td>
<td>Hochschule Luzern</td>
<td></td>
<td>https://www.hslu.ch/en/</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2191 8943</td>
<td></td>
<td>19480920</td>
<td>Q664028</td>
<td>grid.425064.1</td>
</tr>
<tr>
<td>13</td>
<td>https://ror.org/00p9jf779</td>
<td>Medicines for Malaria Venture (MMV)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.mmv.org/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td>MMV</td>
<td></td>
<td>0000 0004 0432 5267</td>
<td>501100004167</td>
<td></td>
<td>Q6806774</td>
<td>grid.452605.0</td>
</tr>
<tr>
<td>14</td>
<td>https://ror.org/038mj2660</td>
<td>Ostschweizer Fachhochschule OST</td>
<td>Eastern Switzerland University of Applied Scie...</td>
<td></td>
<td></td>
<td></td>
<td>https://www.ost.ch/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.510272.3</td>
</tr>
<tr>
<td>15</td>
<td>https://ror.org/05jf1ma54</td>
<td>Pädagogische Hochschule Bern</td>
<td>Bern University of Teacher Education</td>
<td></td>
<td></td>
<td></td>
<td>https://www.phbern.ch</td>
<td>215</td>
<td>2005</td>
<td>0</td>
<td></td>
<td>PHBern</td>
<td>0000 0000 8585 5665</td>
<td></td>
<td></td>
<td></td>
<td>grid.454333.6</td>
</tr>
<tr>
<td>16</td>
<td>https://ror.org/02fjgft97</td>
<td>Pädagogische Hochschule Graubünden (PHGR)</td>
<td></td>
<td></td>
<td></td>
<td>Alta scuola pedagogica dei Grigioni</td>
<td>http://www.phgr.ch/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHGR</td>
<td></td>
<td>0000 0000 9317 283X</td>
<td></td>
<td></td>
<td></td>
<td>grid.469478.0</td>
</tr>
<tr>
<td>17</td>
<td>https://ror.org/03fs41j10</td>
<td>Pädagogische Hochschule Schaffhausen (PHSH)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phsh.ch/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHSH</td>
<td></td>
<td>0000 0004 0450 7546</td>
<td></td>
<td></td>
<td></td>
<td>grid.466133.5</td>
</tr>
<tr>
<td>18</td>
<td>https://ror.org/04bf6dq94</td>
<td>Pädagogische Hochschule Thurgau (PHTG)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phtg.ch/home/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHTG</td>
<td></td>
<td>0000 0004 0613 3824</td>
<td></td>
<td></td>
<td></td>
<td>grid.466322.7</td>
</tr>
<tr>
<td>19</td>
<td>https://ror.org/040gs8e06</td>
<td>Pädagogische Hochschule Wallis (PH-VS)</td>
<td></td>
<td>Haute École Pédagogique du Valais</td>
<td></td>
<td></td>
<td>http://www.hepvs.ch/de</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>PH-VS</td>
<td></td>
<td>0000 0001 2178 3217</td>
<td></td>
<td></td>
<td></td>
<td>grid.466216.1</td>
</tr>
<tr>
<td>20</td>
<td>https://ror.org/00rqdn375</td>
<td>Schwyz University of Teacher Education (PHSZ)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Schwyz</td>
<td></td>
<td>https://www.phsz.ch/en/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHSZ</td>
<td>PHZ Schwyz</td>
<td>0000 0004 0613 7454</td>
<td></td>
<td></td>
<td></td>
<td>grid.466169.a</td>
</tr>
<tr>
<td>21</td>
<td>https://ror.org/05m37v666</td>
<td>St.Gallen University of Teacher Education (PHSG)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule St. Gallen</td>
<td></td>
<td>https://www.phsg.ch/en</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>PHSG</td>
<td></td>
<td>0000 0001 0271 5139</td>
<td></td>
<td></td>
<td>Q1768652</td>
<td>grid.466208.e</td>
</tr>
<tr>
<td>22</td>
<td>https://ror.org/00zg4za48</td>
<td>Swiss Federal Institute for Vocational Educati...</td>
<td></td>
<td>Institut Fédéral des Hautes Études en Formatio...</td>
<td>Eidgenössisches Hochschulinstitut für Berufsbi...</td>
<td></td>
<td>http://www.ehb-schweiz.ch/en/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>SFIVET</td>
<td></td>
<td>0000 0001 2285 5681</td>
<td></td>
<td></td>
<td>Q1302632</td>
<td>grid.466173.1</td>
</tr>
<tr>
<td>23</td>
<td>https://ror.org/03mcsbr76</td>
<td>Swiss Ornithological Institute</td>
<td></td>
<td></td>
<td>Schweizerische Vogelwarte</td>
<td></td>
<td>http://www.vogelwarte.ch/de/home/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 1512 3677</td>
<td></td>
<td></td>
<td>Q663638</td>
<td>grid.419767.a</td>
</tr>
<tr>
<td>24</td>
<td>https://ror.org/03c4atk17</td>
<td>Universita della Svizzera Italiana (USI)</td>
<td>University of Italian Switzerland</td>
<td>Université de la suisse italienne</td>
<td></td>
<td>Università della Svizzera italiana</td>
<td>http://www.usi.ch/en/index.htm</td>
<td>215</td>
<td>1996</td>
<td>0</td>
<td>USI</td>
<td></td>
<td>0000 0001 2203 2861</td>
<td></td>
<td>2290642</td>
<td>Q689617</td>
<td>grid.29078.34</td>
</tr>
<tr>
<td>25</td>
<td>https://ror.org/04mq2g308</td>
<td>University of Applied Sciences and Arts Northw...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.fhnw.ch/homepage</td>
<td>215</td>
<td>2006</td>
<td>0</td>
<td>FHNW</td>
<td>Fachhochschule Nordwestschweiz</td>
<td>0000 0001 1497 8091</td>
<td></td>
<td></td>
<td></td>
<td>grid.410380.e</td>
</tr>
<tr>
<td>26</td>
<td>https://ror.org/01xkakk17</td>
<td>University of Applied Sciences and Arts Wester...</td>
<td></td>
<td>Haute École Spécialisée de Suisse Occidentale</td>
<td>Fachhochschule Westschweiz</td>
<td></td>
<td>http://www.hes-so.ch/en/homepage-hes-so-1679.html</td>
<td>215</td>
<td>1998</td>
<td>0</td>
<td>HES-SO</td>
<td></td>
<td>0000 0001 0943 1999</td>
<td></td>
<td>10128956</td>
<td>Q168003</td>
<td>grid.5681.a</td>
</tr>
<tr>
<td>27</td>
<td>https://ror.org/05ep8g269</td>
<td>University of Applied Sciences and Arts of Sou...</td>
<td></td>
<td></td>
<td></td>
<td>Scuola Universitaria Professionale della Svizz...</td>
<td>http://www.supsi.ch/home_en.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>SUPSI</td>
<td></td>
<td>0000000123252233</td>
<td></td>
<td>34066841</td>
<td>Q663984</td>
<td>grid.16058.3a</td>
</tr>
<tr>
<td>28</td>
<td>https://ror.org/00w9q2c06</td>
<td>University of Applied Sciences of Special Need...</td>
<td></td>
<td></td>
<td>Interkantonale Hochschule für Heilpädagogik</td>
<td></td>
<td>http://www.hfh.ch/en/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td>HfH</td>
<td>Zurich Training College for Teachers of Specia...</td>
<td>0000 0001 0710 6332</td>
<td></td>
<td></td>
<td></td>
<td>grid.466279.8</td>
</tr>
<tr>
<td>29</td>
<td>https://ror.org/032ymzc07</td>
<td>University of Applied Sciences of the Grisons</td>
<td></td>
<td></td>
<td>Fachhochschule Graubünden</td>
<td></td>
<td>https://www.fhgr.ch/en/</td>
<td>215</td>
<td>1963</td>
<td>0</td>
<td></td>
<td>Hochschule für Technik und Wirtschaft Chur</td>
<td>0000 0000 8718 2812</td>
<td></td>
<td></td>
<td>Q1622220</td>
<td>grid.460104.7</td>
</tr>
<tr>
<td>30</td>
<td>https://ror.org/02s6k3f65</td>
<td>University of Basel</td>
<td></td>
<td>Université de bâle</td>
<td>Universität Basel</td>
<td>Università di Basilea</td>
<td>https://www.unibas.ch/de</td>
<td>215</td>
<td>1460</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 1937 0642</td>
<td>100008375</td>
<td>427614</td>
<td>Q372608</td>
<td>grid.6612.3</td>
</tr>
<tr>
<td>31</td>
<td>https://ror.org/02k7v4d05</td>
<td>University of Bern (UB)</td>
<td></td>
<td>Université de Berne</td>
<td>Universität Bern</td>
<td>Università di Berna</td>
<td>http://www.unibe.ch/eng/</td>
<td>215</td>
<td>1834</td>
<td>0</td>
<td>UB</td>
<td></td>
<td>0000 0001 0726 5157</td>
<td>100009068</td>
<td>1157515</td>
<td>Q659080</td>
<td>grid.5734.5</td>
</tr>
<tr>
<td>32</td>
<td>https://ror.org/022fs9h90</td>
<td>University of Fribourg</td>
<td></td>
<td>Université de Fribourg</td>
<td>Universität Freiburg</td>
<td>Università di Friburgo</td>
<td>http://www.unifr.ch/home/welcomeE.php</td>
<td>215</td>
<td>1889</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 0478 1713</td>
<td>501100005869</td>
<td>535267</td>
<td>Q36188</td>
<td>grid.8534.a</td>
</tr>
<tr>
<td>33</td>
<td>https://ror.org/019whta54</td>
<td>University of Lausanne (UNIL)</td>
<td></td>
<td>Université de Lausanne</td>
<td>Universität Lausanne</td>
<td>Università di Losanna</td>
<td>http://www.unil.ch/central/en/home.html</td>
<td>215</td>
<td>1537</td>
<td>0</td>
<td>UNIL</td>
<td>Schola Lausannensis</td>
<td>0000 0001 2165 4204</td>
<td>501100006390</td>
<td>79810</td>
<td>Q658975</td>
<td>grid.9851.5</td>
</tr>
<tr>
<td>34</td>
<td>https://ror.org/01qjrx392</td>
<td>University of Liechtenstein</td>
<td></td>
<td></td>
<td>Universität Liechtenstein</td>
<td></td>
<td>https://www.uni.li/study/de/</td>
<td>128</td>
<td>1961</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2227 4668</td>
<td></td>
<td>10554064</td>
<td>Q974328</td>
<td>grid.445905.9</td>
</tr>
<tr>
<td>35</td>
<td>https://ror.org/00kgrkn83</td>
<td>University of Lucerne (UNILU)</td>
<td></td>
<td>Université de lucerne</td>
<td>Universität Luzern</td>
<td>Università di Lucerna</td>
<td>http://www.unilu.ch/</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>UNILU</td>
<td></td>
<td>0000 0001 1456 7938</td>
<td></td>
<td>21004764</td>
<td>Q673308</td>
<td>grid.449852.6</td>
</tr>
<tr>
<td>36</td>
<td>https://ror.org/00vasag41</td>
<td>University of Neuchâtel</td>
<td></td>
<td>Université de neuchâtel</td>
<td>Universität Neuenburg</td>
<td></td>
<td>http://www2.unine.ch/</td>
<td>215</td>
<td>1838</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2297 7718</td>
<td>501100005353</td>
<td>3662101</td>
<td>Q541548</td>
<td>grid.10711.36</td>
</tr>
<tr>
<td>37</td>
<td>https://ror.org/0561a3s31</td>
<td>University of St. Gallen (HSG)</td>
<td></td>
<td>Université de saint-gall</td>
<td>Universität St. Gallen</td>
<td>Università di San Gallo</td>
<td>http://www.es.unisg.ch/en/</td>
<td>215</td>
<td>1898</td>
<td>0</td>
<td>HSG</td>
<td></td>
<td>0000 0001 2156 6618</td>
<td>100009572</td>
<td>751473</td>
<td>Q673354</td>
<td>grid.15775.31</td>
</tr>
<tr>
<td>38</td>
<td>https://ror.org/0235ynq74</td>
<td>University of Teacher Education Lucerne</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Luzern</td>
<td></td>
<td>http://www.phlu.ch/ute-lucerne/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td></td>
<td>PH Luzern</td>
<td>0000 0001 0348 1637</td>
<td></td>
<td></td>
<td></td>
<td>grid.465965.d</td>
</tr>
<tr>
<td>39</td>
<td>https://ror.org/05ghhx264</td>
<td>University of Teacher Education Zug (PH Zug)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zug</td>
<td></td>
<td>https://www.zg.ch/behoerden/direktion-fur-bild...</td>
<td>215</td>
<td>2013</td>
<td>0</td>
<td>PH Zug</td>
<td></td>
<td>0000 0004 0449 2225</td>
<td></td>
<td></td>
<td></td>
<td>grid.466274.5</td>
</tr>
<tr>
<td>40</td>
<td>https://ror.org/02crff812</td>
<td>University of Zurich (UZH)</td>
<td></td>
<td>Université de zurich</td>
<td>Universität Zürich</td>
<td>Università di Zurigo</td>
<td>http://www.uzh.ch/index_en.html</td>
<td>215</td>
<td>1833</td>
<td>0</td>
<td>UZH</td>
<td></td>
<td>0000 0004 1937 0650</td>
<td>501100006447</td>
<td>314803</td>
<td>Q206702</td>
<td>grid.7400.3</td>
</tr>
<tr>
<td>41</td>
<td>https://ror.org/05pmsvm27</td>
<td>Zurich University of Applied Sciences (ZHAW)</td>
<td></td>
<td></td>
<td>Zürcher Hochschule für Angewandte Wissenschaften</td>
<td></td>
<td>https://www.zhaw.ch/en/university/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>ZHAW</td>
<td></td>
<td>0000000122291644</td>
<td></td>
<td>30930550</td>
<td>Q2605554</td>
<td>grid.19739.35</td>
</tr>
<tr>
<td>42</td>
<td>https://ror.org/02ejkey04</td>
<td>Zurich University of Applied Sciences in Busin...</td>
<td></td>
<td></td>
<td>Hochschule für Wirtschaft Zürich</td>
<td></td>
<td>http://www.fh-hwz.ch/en</td>
<td>215</td>
<td>1986</td>
<td>0</td>
<td>HWZ</td>
<td></td>
<td>0000 0001 0008 3713</td>
<td></td>
<td>30805829</td>
<td>Q1488771</td>
<td>grid.449909.9</td>
</tr>
<tr>
<td>43</td>
<td>https://ror.org/01awgk221</td>
<td>Zurich University of Teacher Education (PHZH)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zürich</td>
<td></td>
<td>https://phzh.ch/en/</td>
<td>215</td>
<td>2002</td>
<td>0</td>
<td>PHZH</td>
<td>PH Zürich</td>
<td>0000 0000 9666 1858</td>
<td></td>
<td></td>
<td></td>
<td>grid.483054.e</td>
</tr>
<tr>
<td>44</td>
<td>https://ror.org/05r0ap620</td>
<td>Zurich University of the Arts</td>
<td></td>
<td>Haute École d'Art de Zurich</td>
<td>Zürcher Hochschule der Künste</td>
<td></td>
<td>https://www.zhdk.ch/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>39250592</td>
<td>Q222450</td>
<td>grid.449912.3</td>
</tr>
</tbody>
</table>
</div>
```python
organization = organization.reset_index(drop=True)
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>ror</th>
<th>name</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_de</th>
<th>label_it</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>acronym</th>
<th>aliases</th>
<th>isni</th>
<th>fundref</th>
<th>orgref</th>
<th>wikidata</th>
<th>grid</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>https://ror.org/02s376052</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td></td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>0000000121839049</td>
<td>501100001703</td>
<td>71968</td>
<td>Q262760</td>
<td>grid.5333.6</td>
</tr>
<tr>
<td>1</td>
<td>https://ror.org/01swzsf04</td>
<td>University of Geneva (UNIGE)</td>
<td></td>
<td>Université de Genève</td>
<td></td>
<td>Università di Ginevra</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>0000 0001 2322 4988</td>
<td>501100006389</td>
<td>342348</td>
<td>Q503473</td>
<td>grid.8591.5</td>
</tr>
<tr>
<td>2</td>
<td>https://ror.org/04d8ztx87</td>
<td>Agroscope</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 4681 910X</td>
<td></td>
<td></td>
<td>Q397466</td>
<td>grid.417771.3</td>
</tr>
<tr>
<td>3</td>
<td>https://ror.org/02bnkt322</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>BFH</td>
<td></td>
<td>0000 0001 0688 6779</td>
<td>501100006259</td>
<td>4365265</td>
<td>Q466455</td>
<td>grid.424060.4</td>
</tr>
<tr>
<td>4</td>
<td>https://ror.org/05a28rw58</td>
<td>ETH Zurich (ETH Zurich)</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td>Politecnico federale di Zurigo</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>0000 0001 2156 2780</td>
<td>501100003006</td>
<td>210910</td>
<td>Q11942</td>
<td>grid.5801.c</td>
</tr>
<tr>
<td>5</td>
<td>https://ror.org/01ggx4157</td>
<td>European Organization for Nuclear Research (CERN)</td>
<td></td>
<td>Organisation européenne pour la recherche nucl...</td>
<td>Europäische Organisation für Kernforschung</td>
<td></td>
<td>http://home.web.cern.ch/</td>
<td>215</td>
<td>1954</td>
<td>0</td>
<td>CERN</td>
<td></td>
<td>0000 0001 2156 142X</td>
<td></td>
<td>37351</td>
<td>Q42944</td>
<td>grid.9132.9</td>
</tr>
<tr>
<td>6</td>
<td>https://ror.org/0210tb741</td>
<td>Forschungsinstitut für biologischen Landbau (F...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.fibl.org/en/germany/location-de.html</td>
<td>215</td>
<td></td>
<td>0</td>
<td>FiBL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.506220.3</td>
</tr>
<tr>
<td>7</td>
<td>https://ror.org/007ygn379</td>
<td>Graduate Institute of International and Develo...</td>
<td></td>
<td>Institut de Hautes études Internationales et d...</td>
<td>Hochschulinstitut für internationale Studien u...</td>
<td></td>
<td>http://graduateinstitute.ch/home.html</td>
<td>215</td>
<td>1927</td>
<td>0</td>
<td>IHEID</td>
<td>Graduate Institute Geneva</td>
<td>0000 0001 2296 9873</td>
<td></td>
<td>14744053</td>
<td>Q691686</td>
<td>grid.424404.2</td>
</tr>
<tr>
<td>8</td>
<td>https://ror.org/015pmkr43</td>
<td>Haute École Pédagogique BEJUNE (HEP BEJUNE)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hep-bejune.ch/</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP BEJUNE</td>
<td></td>
<td>0000 0001 0658 3479</td>
<td></td>
<td></td>
<td></td>
<td>grid.469449.2</td>
</tr>
<tr>
<td>9</td>
<td>https://ror.org/048gre751</td>
<td>Haute École Pédagogique Fribourg (HEP-PH FR)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>https://www.hepfr.ch/</td>
<td>215</td>
<td>1990</td>
<td>0</td>
<td>HEP-PH FR</td>
<td></td>
<td>0000 0001 0266 4909</td>
<td></td>
<td></td>
<td></td>
<td>grid.469451.b</td>
</tr>
<tr>
<td>10</td>
<td>https://ror.org/01bvm0h13</td>
<td>Haute École Pédagogique du Canton de Vaud (HEP...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.hepl.ch/cms/accueil.html</td>
<td>215</td>
<td>2001</td>
<td>0</td>
<td>HEP Vaud</td>
<td></td>
<td>0000 0004 0613 4050</td>
<td></td>
<td></td>
<td></td>
<td>grid.466224.0</td>
</tr>
<tr>
<td>11</td>
<td>https://ror.org/049c2kr37</td>
<td>Kalaidos University of Applied Sciences (Kalai...</td>
<td></td>
<td></td>
<td>Kalaidos Fachhochschule</td>
<td></td>
<td>https://www.kalaidos-fh.ch/de-CH</td>
<td>215</td>
<td>1995</td>
<td>0</td>
<td>Kalaidos UAS</td>
<td></td>
<td>0000 0004 0453 9054</td>
<td></td>
<td>6746630</td>
<td>Q681372</td>
<td>grid.449532.d</td>
</tr>
<tr>
<td>12</td>
<td>https://ror.org/021f7p178</td>
<td>Lib4RI - Library for the Research Institutes w...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.lib4ri.ch/</td>
<td>215</td>
<td>2011</td>
<td>0</td>
<td></td>
<td>Lib4RI</td>
<td>0000 0004 0624 8541</td>
<td></td>
<td></td>
<td>Q1278450</td>
<td>grid.458352.d</td>
</tr>
<tr>
<td>13</td>
<td>https://ror.org/04nd0xd48</td>
<td>Lucerne University of Applied Sciences and Arts</td>
<td></td>
<td>Haute École de lucerne</td>
<td>Hochschule Luzern</td>
<td></td>
<td>https://www.hslu.ch/en/</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2191 8943</td>
<td></td>
<td>19480920</td>
<td>Q664028</td>
<td>grid.425064.1</td>
</tr>
<tr>
<td>14</td>
<td>https://ror.org/00p9jf779</td>
<td>Medicines for Malaria Venture (MMV)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.mmv.org/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td>MMV</td>
<td></td>
<td>0000 0004 0432 5267</td>
<td>501100004167</td>
<td></td>
<td>Q6806774</td>
<td>grid.452605.0</td>
</tr>
<tr>
<td>15</td>
<td>https://ror.org/038mj2660</td>
<td>Ostschweizer Fachhochschule OST</td>
<td>Eastern Switzerland University of Applied Scie...</td>
<td></td>
<td></td>
<td></td>
<td>https://www.ost.ch/</td>
<td>215</td>
<td>1999</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>grid.510272.3</td>
</tr>
<tr>
<td>16</td>
<td>https://ror.org/05jf1ma54</td>
<td>Pädagogische Hochschule Bern</td>
<td>Bern University of Teacher Education</td>
<td></td>
<td></td>
<td></td>
<td>https://www.phbern.ch</td>
<td>215</td>
<td>2005</td>
<td>0</td>
<td></td>
<td>PHBern</td>
<td>0000 0000 8585 5665</td>
<td></td>
<td></td>
<td></td>
<td>grid.454333.6</td>
</tr>
<tr>
<td>17</td>
<td>https://ror.org/02fjgft97</td>
<td>Pädagogische Hochschule Graubünden (PHGR)</td>
<td></td>
<td></td>
<td></td>
<td>Alta scuola pedagogica dei Grigioni</td>
<td>http://www.phgr.ch/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHGR</td>
<td></td>
<td>0000 0000 9317 283X</td>
<td></td>
<td></td>
<td></td>
<td>grid.469478.0</td>
</tr>
<tr>
<td>18</td>
<td>https://ror.org/03fs41j10</td>
<td>Pädagogische Hochschule Schaffhausen (PHSH)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phsh.ch/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHSH</td>
<td></td>
<td>0000 0004 0450 7546</td>
<td></td>
<td></td>
<td></td>
<td>grid.466133.5</td>
</tr>
<tr>
<td>19</td>
<td>https://ror.org/04bf6dq94</td>
<td>Pädagogische Hochschule Thurgau (PHTG)</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.phtg.ch/home/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td>PHTG</td>
<td></td>
<td>0000 0004 0613 3824</td>
<td></td>
<td></td>
<td></td>
<td>grid.466322.7</td>
</tr>
<tr>
<td>20</td>
<td>https://ror.org/040gs8e06</td>
<td>Pädagogische Hochschule Wallis (PH-VS)</td>
<td></td>
<td>Haute École Pédagogique du Valais</td>
<td></td>
<td></td>
<td>http://www.hepvs.ch/de</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>PH-VS</td>
<td></td>
<td>0000 0001 2178 3217</td>
<td></td>
<td></td>
<td></td>
<td>grid.466216.1</td>
</tr>
<tr>
<td>21</td>
<td>https://ror.org/00rqdn375</td>
<td>Schwyz University of Teacher Education (PHSZ)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Schwyz</td>
<td></td>
<td>https://www.phsz.ch/en/</td>
<td>215</td>
<td></td>
<td>0</td>
<td>PHSZ</td>
<td>PHZ Schwyz</td>
<td>0000 0004 0613 7454</td>
<td></td>
<td></td>
<td></td>
<td>grid.466169.a</td>
</tr>
<tr>
<td>22</td>
<td>https://ror.org/05m37v666</td>
<td>St.Gallen University of Teacher Education (PHSG)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule St. Gallen</td>
<td></td>
<td>https://www.phsg.ch/en</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>PHSG</td>
<td></td>
<td>0000 0001 0271 5139</td>
<td></td>
<td></td>
<td>Q1768652</td>
<td>grid.466208.e</td>
</tr>
<tr>
<td>23</td>
<td>https://ror.org/00zg4za48</td>
<td>Swiss Federal Institute for Vocational Educati...</td>
<td></td>
<td>Institut Fédéral des Hautes Études en Formatio...</td>
<td>Eidgenössisches Hochschulinstitut für Berufsbi...</td>
<td></td>
<td>http://www.ehb-schweiz.ch/en/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>SFIVET</td>
<td></td>
<td>0000 0001 2285 5681</td>
<td></td>
<td></td>
<td>Q1302632</td>
<td>grid.466173.1</td>
</tr>
<tr>
<td>24</td>
<td>https://ror.org/03mcsbr76</td>
<td>Swiss Ornithological Institute</td>
<td></td>
<td></td>
<td>Schweizerische Vogelwarte</td>
<td></td>
<td>http://www.vogelwarte.ch/de/home/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 1512 3677</td>
<td></td>
<td></td>
<td>Q663638</td>
<td>grid.419767.a</td>
</tr>
<tr>
<td>25</td>
<td>https://ror.org/03c4atk17</td>
<td>Universita della Svizzera Italiana (USI)</td>
<td>University of Italian Switzerland</td>
<td>Université de la suisse italienne</td>
<td></td>
<td>Università della Svizzera italiana</td>
<td>http://www.usi.ch/en/index.htm</td>
<td>215</td>
<td>1996</td>
<td>0</td>
<td>USI</td>
<td></td>
<td>0000 0001 2203 2861</td>
<td></td>
<td>2290642</td>
<td>Q689617</td>
<td>grid.29078.34</td>
</tr>
<tr>
<td>26</td>
<td>https://ror.org/04mq2g308</td>
<td>University of Applied Sciences and Arts Northw...</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>http://www.fhnw.ch/homepage</td>
<td>215</td>
<td>2006</td>
<td>0</td>
<td>FHNW</td>
<td>Fachhochschule Nordwestschweiz</td>
<td>0000 0001 1497 8091</td>
<td></td>
<td></td>
<td></td>
<td>grid.410380.e</td>
</tr>
<tr>
<td>27</td>
<td>https://ror.org/01xkakk17</td>
<td>University of Applied Sciences and Arts Wester...</td>
<td></td>
<td>Haute École Spécialisée de Suisse Occidentale</td>
<td>Fachhochschule Westschweiz</td>
<td></td>
<td>http://www.hes-so.ch/en/homepage-hes-so-1679.html</td>
<td>215</td>
<td>1998</td>
<td>0</td>
<td>HES-SO</td>
<td></td>
<td>0000 0001 0943 1999</td>
<td></td>
<td>10128956</td>
<td>Q168003</td>
<td>grid.5681.a</td>
</tr>
<tr>
<td>28</td>
<td>https://ror.org/05ep8g269</td>
<td>University of Applied Sciences and Arts of Sou...</td>
<td></td>
<td></td>
<td></td>
<td>Scuola Universitaria Professionale della Svizz...</td>
<td>http://www.supsi.ch/home_en.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>SUPSI</td>
<td></td>
<td>0000000123252233</td>
<td></td>
<td>34066841</td>
<td>Q663984</td>
<td>grid.16058.3a</td>
</tr>
<tr>
<td>29</td>
<td>https://ror.org/00w9q2c06</td>
<td>University of Applied Sciences of Special Need...</td>
<td></td>
<td></td>
<td>Interkantonale Hochschule für Heilpädagogik</td>
<td></td>
<td>http://www.hfh.ch/en/</td>
<td>215</td>
<td>1924</td>
<td>0</td>
<td>HfH</td>
<td>Zurich Training College for Teachers of Specia...</td>
<td>0000 0001 0710 6332</td>
<td></td>
<td></td>
<td></td>
<td>grid.466279.8</td>
</tr>
<tr>
<td>30</td>
<td>https://ror.org/032ymzc07</td>
<td>University of Applied Sciences of the Grisons</td>
<td></td>
<td></td>
<td>Fachhochschule Graubünden</td>
<td></td>
<td>https://www.fhgr.ch/en/</td>
<td>215</td>
<td>1963</td>
<td>0</td>
<td></td>
<td>Hochschule für Technik und Wirtschaft Chur</td>
<td>0000 0000 8718 2812</td>
<td></td>
<td></td>
<td>Q1622220</td>
<td>grid.460104.7</td>
</tr>
<tr>
<td>31</td>
<td>https://ror.org/02s6k3f65</td>
<td>University of Basel</td>
<td></td>
<td>Université de bâle</td>
<td>Universität Basel</td>
<td>Università di Basilea</td>
<td>https://www.unibas.ch/de</td>
<td>215</td>
<td>1460</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 1937 0642</td>
<td>100008375</td>
<td>427614</td>
<td>Q372608</td>
<td>grid.6612.3</td>
</tr>
<tr>
<td>32</td>
<td>https://ror.org/02k7v4d05</td>
<td>University of Bern (UB)</td>
<td></td>
<td>Université de Berne</td>
<td>Universität Bern</td>
<td>Università di Berna</td>
<td>http://www.unibe.ch/eng/</td>
<td>215</td>
<td>1834</td>
<td>0</td>
<td>UB</td>
<td></td>
<td>0000 0001 0726 5157</td>
<td>100009068</td>
<td>1157515</td>
<td>Q659080</td>
<td>grid.5734.5</td>
</tr>
<tr>
<td>33</td>
<td>https://ror.org/022fs9h90</td>
<td>University of Fribourg</td>
<td></td>
<td>Université de Fribourg</td>
<td>Universität Freiburg</td>
<td>Università di Friburgo</td>
<td>http://www.unifr.ch/home/welcomeE.php</td>
<td>215</td>
<td>1889</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0004 0478 1713</td>
<td>501100005869</td>
<td>535267</td>
<td>Q36188</td>
<td>grid.8534.a</td>
</tr>
<tr>
<td>34</td>
<td>https://ror.org/019whta54</td>
<td>University of Lausanne (UNIL)</td>
<td></td>
<td>Université de Lausanne</td>
<td>Universität Lausanne</td>
<td>Università di Losanna</td>
<td>http://www.unil.ch/central/en/home.html</td>
<td>215</td>
<td>1537</td>
<td>0</td>
<td>UNIL</td>
<td>Schola Lausannensis</td>
<td>0000 0001 2165 4204</td>
<td>501100006390</td>
<td>79810</td>
<td>Q658975</td>
<td>grid.9851.5</td>
</tr>
<tr>
<td>35</td>
<td>https://ror.org/01qjrx392</td>
<td>University of Liechtenstein</td>
<td></td>
<td></td>
<td>Universität Liechtenstein</td>
<td></td>
<td>https://www.uni.li/study/de/</td>
<td>128</td>
<td>1961</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2227 4668</td>
<td></td>
<td>10554064</td>
<td>Q974328</td>
<td>grid.445905.9</td>
</tr>
<tr>
<td>36</td>
<td>https://ror.org/00kgrkn83</td>
<td>University of Lucerne (UNILU)</td>
<td></td>
<td>Université de lucerne</td>
<td>Universität Luzern</td>
<td>Università di Lucerna</td>
<td>http://www.unilu.ch/</td>
<td>215</td>
<td>2000</td>
<td>0</td>
<td>UNILU</td>
<td></td>
<td>0000 0001 1456 7938</td>
<td></td>
<td>21004764</td>
<td>Q673308</td>
<td>grid.449852.6</td>
</tr>
<tr>
<td>37</td>
<td>https://ror.org/00vasag41</td>
<td>University of Neuchâtel</td>
<td></td>
<td>Université de neuchâtel</td>
<td>Universität Neuenburg</td>
<td></td>
<td>http://www2.unine.ch/</td>
<td>215</td>
<td>1838</td>
<td>0</td>
<td></td>
<td></td>
<td>0000 0001 2297 7718</td>
<td>501100005353</td>
<td>3662101</td>
<td>Q541548</td>
<td>grid.10711.36</td>
</tr>
<tr>
<td>38</td>
<td>https://ror.org/0561a3s31</td>
<td>University of St. Gallen (HSG)</td>
<td></td>
<td>Université de saint-gall</td>
<td>Universität St. Gallen</td>
<td>Università di San Gallo</td>
<td>http://www.es.unisg.ch/en/</td>
<td>215</td>
<td>1898</td>
<td>0</td>
<td>HSG</td>
<td></td>
<td>0000 0001 2156 6618</td>
<td>100009572</td>
<td>751473</td>
<td>Q673354</td>
<td>grid.15775.31</td>
</tr>
<tr>
<td>39</td>
<td>https://ror.org/0235ynq74</td>
<td>University of Teacher Education Lucerne</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Luzern</td>
<td></td>
<td>http://www.phlu.ch/ute-lucerne/</td>
<td>215</td>
<td>2003</td>
<td>0</td>
<td></td>
<td>PH Luzern</td>
<td>0000 0001 0348 1637</td>
<td></td>
<td></td>
<td></td>
<td>grid.465965.d</td>
</tr>
<tr>
<td>40</td>
<td>https://ror.org/05ghhx264</td>
<td>University of Teacher Education Zug (PH Zug)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zug</td>
<td></td>
<td>https://www.zg.ch/behoerden/direktion-fur-bild...</td>
<td>215</td>
<td>2013</td>
<td>0</td>
<td>PH Zug</td>
<td></td>
<td>0000 0004 0449 2225</td>
<td></td>
<td></td>
<td></td>
<td>grid.466274.5</td>
</tr>
<tr>
<td>41</td>
<td>https://ror.org/02crff812</td>
<td>University of Zurich (UZH)</td>
<td></td>
<td>Université de zurich</td>
<td>Universität Zürich</td>
<td>Università di Zurigo</td>
<td>http://www.uzh.ch/index_en.html</td>
<td>215</td>
<td>1833</td>
<td>0</td>
<td>UZH</td>
<td></td>
<td>0000 0004 1937 0650</td>
<td>501100006447</td>
<td>314803</td>
<td>Q206702</td>
<td>grid.7400.3</td>
</tr>
<tr>
<td>42</td>
<td>https://ror.org/05pmsvm27</td>
<td>Zurich University of Applied Sciences (ZHAW)</td>
<td></td>
<td></td>
<td>Zürcher Hochschule für Angewandte Wissenschaften</td>
<td></td>
<td>https://www.zhaw.ch/en/university/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td>ZHAW</td>
<td></td>
<td>0000000122291644</td>
<td></td>
<td>30930550</td>
<td>Q2605554</td>
<td>grid.19739.35</td>
</tr>
<tr>
<td>43</td>
<td>https://ror.org/02ejkey04</td>
<td>Zurich University of Applied Sciences in Busin...</td>
<td></td>
<td></td>
<td>Hochschule für Wirtschaft Zürich</td>
<td></td>
<td>http://www.fh-hwz.ch/en</td>
<td>215</td>
<td>1986</td>
<td>0</td>
<td>HWZ</td>
<td></td>
<td>0000 0001 0008 3713</td>
<td></td>
<td>30805829</td>
<td>Q1488771</td>
<td>grid.449909.9</td>
</tr>
<tr>
<td>44</td>
<td>https://ror.org/01awgk221</td>
<td>Zurich University of Teacher Education (PHZH)</td>
<td></td>
<td></td>
<td>Pädagogische Hochschule Zürich</td>
<td></td>
<td>https://phzh.ch/en/</td>
<td>215</td>
<td>2002</td>
<td>0</td>
<td>PHZH</td>
<td>PH Zürich</td>
<td>0000 0000 9666 1858</td>
<td></td>
<td></td>
<td></td>
<td>grid.483054.e</td>
</tr>
<tr>
<td>45</td>
<td>https://ror.org/05r0ap620</td>
<td>Zurich University of the Arts</td>
<td></td>
<td>Haute École d'Art de Zurich</td>
<td>Zürcher Hochschule der Künste</td>
<td></td>
<td>https://www.zhdk.ch/</td>
<td>215</td>
<td>2007</td>
<td>0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>39250592</td>
<td>Q222450</td>
<td>grid.449912.3</td>
</tr>
</tbody>
</table>
</div>
```python
# ajout des funders
organization = organization.append(organization_funders, ignore_index=True)
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>acronym</th>
<th>aliases</th>
<th>country</th>
<th>fundref</th>
<th>grid</th>
<th>is_funder</th>
<th>isni</th>
<th>iso_code</th>
<th>label_de</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_it</th>
<th>name</th>
<th>orgref</th>
<th>ror</th>
<th>sherpa_id</th>
<th>starting_year</th>
<th>website</th>
<th>wikidata</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>215</td>
<td>501100001703</td>
<td>grid.5333.6</td>
<td>0</td>
<td>0000000121839049</td>
<td>NaN</td>
<td></td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>71968</td>
<td>https://ror.org/02s376052</td>
<td>NaN</td>
<td>1853</td>
<td>http://www.epfl.ch/index.en.html</td>
<td>Q262760</td>
</tr>
<tr>
<td>1</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>215</td>
<td>501100006389</td>
<td>grid.8591.5</td>
<td>0</td>
<td>0000 0001 2322 4988</td>
<td>NaN</td>
<td></td>
<td></td>
<td>Université de Genève</td>
<td>Università di Ginevra</td>
<td>University of Geneva (UNIGE)</td>
<td>342348</td>
<td>https://ror.org/01swzsf04</td>
<td>NaN</td>
<td>1559</td>
<td>https://www.unige.ch/</td>
<td>Q503473</td>
</tr>
<tr>
<td>2</td>
<td></td>
<td></td>
<td>215</td>
<td></td>
<td>grid.417771.3</td>
<td>0</td>
<td>0000 0004 4681 910X</td>
<td>NaN</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Agroscope</td>
<td></td>
<td>https://ror.org/04d8ztx87</td>
<td>NaN</td>
<td>1850</td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>Q397466</td>
</tr>
<tr>
<td>3</td>
<td>BFH</td>
<td></td>
<td>215</td>
<td>501100006259</td>
<td>grid.424060.4</td>
<td>0</td>
<td>0000 0001 0688 6779</td>
<td>NaN</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td></td>
<td>Bern University of Applied Sciences (BFH)</td>
<td>4365265</td>
<td>https://ror.org/02bnkt322</td>
<td>NaN</td>
<td>1997</td>
<td>http://www.bfh.ch/en/home.html</td>
<td>Q466455</td>
</tr>
<tr>
<td>4</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>215</td>
<td>501100003006</td>
<td>grid.5801.c</td>
<td>0</td>
<td>0000 0001 2156 2780</td>
<td>NaN</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Politecnico federale di Zurigo</td>
<td>ETH Zurich (ETH Zurich)</td>
<td>210910</td>
<td>https://ror.org/05a28rw58</td>
<td>NaN</td>
<td>1855</td>
<td>https://www.ethz.ch/en.html</td>
<td>Q11942</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>99</td>
<td>NaN</td>
<td>NaN</td>
<td>236</td>
<td>http://dx.doi.org/10.13039/100000104</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>US</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>National Aeronautics and Space Administration ...</td>
<td>NaN</td>
<td>https://ror.org/027ka1x80</td>
<td>986.0</td>
<td>NaN</td>
<td>http://science.nasa.gov/</td>
<td>NaN</td>
</tr>
<tr>
<td>100</td>
<td>NaN</td>
<td>NaN</td>
<td>236</td>
<td>http://dx.doi.org/10.13039/100000001</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>US</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>National Science Foundation (NSF)</td>
<td>NaN</td>
<td>https://ror.org/021nxhr62</td>
<td>354.0</td>
<td>NaN</td>
<td>http://www.nsf.gov/</td>
<td>NaN</td>
</tr>
<tr>
<td>101</td>
<td>NaN</td>
<td>NaN</td>
<td>234</td>
<td>http://dx.doi.org/10.13039/501100000691</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>GB</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Academy of Medical Science</td>
<td>NaN</td>
<td>https://ror.org/00c489v88</td>
<td>1125.0</td>
<td>NaN</td>
<td>https://acmedsci.ac.uk/</td>
<td>NaN</td>
</tr>
<tr>
<td>102</td>
<td>NaN</td>
<td>NaN</td>
<td>234</td>
<td>http://dx.doi.org/10.13039/501100000771</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>GB</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Prostate Cancer UK</td>
<td>NaN</td>
<td>https://ror.org/04dkv6329</td>
<td>742.0</td>
<td>NaN</td>
<td>http://prostatecanceruk.org/</td>
<td>NaN</td>
</tr>
<tr>
<td>103</td>
<td>NaN</td>
<td>NaN</td>
<td>215</td>
<td>http://dx.doi.org/10.13039/501100001711</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>CH</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>NaN</td>
<td>https://ror.org/00yjd3n13</td>
<td>25.0</td>
<td>NaN</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>104 rows × 19 columns</p>
</div>
```python
# remplacement dans le fundref id qui renvoie vers du JSON seulement
# URL actuel : http://data.crossref.org/fundingdata/funder/10.13039/[fundref id]
# ex : http://dx.doi.org/10.13039/501100007903
# redirigé sur : http://data.crossref.org/fundingdata/funder/10.13039/501100007903
# URL des publications financées : https://search.crossref.org/funding?q=[fundref id]&from_ui=yes
# ex : https://search.crossref.org/funding?q=501100003006&from_ui=yes
organization['fundref'] = organization['fundref'].str.replace('http://dx.doi.org/10.13039/', '')
organization
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>acronym</th>
<th>aliases</th>
<th>country</th>
<th>fundref</th>
<th>grid</th>
<th>is_funder</th>
<th>isni</th>
<th>iso_code</th>
<th>label_de</th>
<th>label_en</th>
<th>label_fr</th>
<th>label_it</th>
<th>name</th>
<th>orgref</th>
<th>ror</th>
<th>sherpa_id</th>
<th>starting_year</th>
<th>website</th>
<th>wikidata</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>EPFL</td>
<td></td>
<td>215</td>
<td>501100001703</td>
<td>grid.5333.6</td>
<td>0</td>
<td>0000000121839049</td>
<td>NaN</td>
<td></td>
<td>Swiss Federal Institute of Technology in Lausanne</td>
<td></td>
<td></td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>71968</td>
<td>https://ror.org/02s376052</td>
<td>NaN</td>
<td>1853</td>
<td>http://www.epfl.ch/index.en.html</td>
<td>Q262760</td>
</tr>
<tr>
<td>1</td>
<td>UNIGE</td>
<td>Schola Genevensis</td>
<td>215</td>
<td>501100006389</td>
<td>grid.8591.5</td>
<td>0</td>
<td>0000 0001 2322 4988</td>
<td>NaN</td>
<td></td>
<td></td>
<td>Université de Genève</td>
<td>Università di Ginevra</td>
<td>University of Geneva (UNIGE)</td>
<td>342348</td>
<td>https://ror.org/01swzsf04</td>
<td>NaN</td>
<td>1559</td>
<td>https://www.unige.ch/</td>
<td>Q503473</td>
</tr>
<tr>
<td>2</td>
<td></td>
<td></td>
<td>215</td>
<td></td>
<td>grid.417771.3</td>
<td>0</td>
<td>0000 0004 4681 910X</td>
<td>NaN</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Agroscope</td>
<td></td>
<td>https://ror.org/04d8ztx87</td>
<td>NaN</td>
<td>1850</td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>Q397466</td>
</tr>
<tr>
<td>3</td>
<td>BFH</td>
<td></td>
<td>215</td>
<td>501100006259</td>
<td>grid.424060.4</td>
<td>0</td>
<td>0000 0001 0688 6779</td>
<td>NaN</td>
<td>Berner Fachhochschule</td>
<td></td>
<td>Haute école spécialisée bernoise</td>
<td></td>
<td>Bern University of Applied Sciences (BFH)</td>
<td>4365265</td>
<td>https://ror.org/02bnkt322</td>
<td>NaN</td>
<td>1997</td>
<td>http://www.bfh.ch/en/home.html</td>
<td>Q466455</td>
</tr>
<tr>
<td>4</td>
<td>ETH Zurich</td>
<td>Swiss Federal Institute of Technology in Zuric...</td>
<td>215</td>
<td>501100003006</td>
<td>grid.5801.c</td>
<td>0</td>
<td>0000 0001 2156 2780</td>
<td>NaN</td>
<td>Eidgenössische Technische Hochschule Zürich</td>
<td></td>
<td>École Polytechnique Fédérale de Zurich</td>
<td>Politecnico federale di Zurigo</td>
<td>ETH Zurich (ETH Zurich)</td>
<td>210910</td>
<td>https://ror.org/05a28rw58</td>
<td>NaN</td>
<td>1855</td>
<td>https://www.ethz.ch/en.html</td>
<td>Q11942</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>99</td>
<td>NaN</td>
<td>NaN</td>
<td>236</td>
<td>100000104</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>US</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>National Aeronautics and Space Administration ...</td>
<td>NaN</td>
<td>https://ror.org/027ka1x80</td>
<td>986.0</td>
<td>NaN</td>
<td>http://science.nasa.gov/</td>
<td>NaN</td>
</tr>
<tr>
<td>100</td>
<td>NaN</td>
<td>NaN</td>
<td>236</td>
<td>100000001</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>US</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>National Science Foundation (NSF)</td>
<td>NaN</td>
<td>https://ror.org/021nxhr62</td>
<td>354.0</td>
<td>NaN</td>
<td>http://www.nsf.gov/</td>
<td>NaN</td>
</tr>
<tr>
<td>101</td>
<td>NaN</td>
<td>NaN</td>
<td>234</td>
<td>501100000691</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>GB</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Academy of Medical Science</td>
<td>NaN</td>
<td>https://ror.org/00c489v88</td>
<td>1125.0</td>
<td>NaN</td>
<td>https://acmedsci.ac.uk/</td>
<td>NaN</td>
</tr>
<tr>
<td>102</td>
<td>NaN</td>
<td>NaN</td>
<td>234</td>
<td>501100000771</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>GB</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Prostate Cancer UK</td>
<td>NaN</td>
<td>https://ror.org/04dkv6329</td>
<td>742.0</td>
<td>NaN</td>
<td>http://prostatecanceruk.org/</td>
<td>NaN</td>
</tr>
<tr>
<td>103</td>
<td>NaN</td>
<td>NaN</td>
<td>215</td>
<td>501100001711</td>
<td>NaN</td>
<td>1</td>
<td>NaN</td>
<td>CH</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>NaN</td>
<td>https://ror.org/00yjd3n13</td>
<td>25.0</td>
<td>NaN</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>NaN</td>
</tr>
</tbody>
</table>
<p>104 rows × 19 columns</p>
</div>
```python
# df pour l'export
organization_export = organization[['name', 'website', 'country', 'starting_year', 'is_funder', 'ror', 'fundref']]
organization_export
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>ror</th>
<th>fundref</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>https://ror.org/02s376052</td>
<td>501100001703</td>
</tr>
<tr>
<td>1</td>
<td>University of Geneva (UNIGE)</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>https://ror.org/01swzsf04</td>
<td>501100006389</td>
</tr>
<tr>
<td>2</td>
<td>Agroscope</td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td>https://ror.org/04d8ztx87</td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>https://ror.org/02bnkt322</td>
<td>501100006259</td>
</tr>
<tr>
<td>4</td>
<td>ETH Zurich (ETH Zurich)</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>https://ror.org/05a28rw58</td>
<td>501100003006</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>99</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://science.nasa.gov/</td>
<td>236</td>
<td>NaN</td>
<td>1</td>
<td>https://ror.org/027ka1x80</td>
<td>100000104</td>
</tr>
<tr>
<td>100</td>
<td>National Science Foundation (NSF)</td>
<td>http://www.nsf.gov/</td>
<td>236</td>
<td>NaN</td>
<td>1</td>
<td>https://ror.org/021nxhr62</td>
<td>100000001</td>
</tr>
<tr>
<td>101</td>
<td>Academy of Medical Science</td>
<td>https://acmedsci.ac.uk/</td>
<td>234</td>
<td>NaN</td>
<td>1</td>
<td>https://ror.org/00c489v88</td>
<td>501100000691</td>
</tr>
<tr>
<td>102</td>
<td>Prostate Cancer UK</td>
<td>http://prostatecanceruk.org/</td>
<td>234</td>
<td>NaN</td>
<td>1</td>
<td>https://ror.org/04dkv6329</td>
<td>501100000771</td>
</tr>
<tr>
<td>103</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>215</td>
<td>NaN</td>
<td>1</td>
<td>https://ror.org/00yjd3n13</td>
<td>501100001711</td>
</tr>
</tbody>
</table>
<p>104 rows × 7 columns</p>
</div>
```python
# ajout des valeurs vides
organization_export['starting_year'] = organization_export['starting_year'].fillna(0)
organization_export['fundref'] = organization_export['fundref'].fillna('')
organization_export['ror'] = organization_export['ror'].fillna('')
organization_export
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:2: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:3: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
This is separate from the ipykernel package so we can avoid doing imports until
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:4: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
after removing the cwd from sys.path.
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>ror</th>
<th>fundref</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>https://ror.org/02s376052</td>
<td>501100001703</td>
</tr>
<tr>
<td>1</td>
<td>University of Geneva (UNIGE)</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>https://ror.org/01swzsf04</td>
<td>501100006389</td>
</tr>
<tr>
<td>2</td>
<td>Agroscope</td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td>https://ror.org/04d8ztx87</td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>https://ror.org/02bnkt322</td>
<td>501100006259</td>
</tr>
<tr>
<td>4</td>
<td>ETH Zurich (ETH Zurich)</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>https://ror.org/05a28rw58</td>
<td>501100003006</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>99</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://science.nasa.gov/</td>
<td>236</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/027ka1x80</td>
<td>100000104</td>
</tr>
<tr>
<td>100</td>
<td>National Science Foundation (NSF)</td>
<td>http://www.nsf.gov/</td>
<td>236</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/021nxhr62</td>
<td>100000001</td>
</tr>
<tr>
<td>101</td>
<td>Academy of Medical Science</td>
<td>https://acmedsci.ac.uk/</td>
<td>234</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/00c489v88</td>
<td>501100000691</td>
</tr>
<tr>
<td>102</td>
<td>Prostate Cancer UK</td>
<td>http://prostatecanceruk.org/</td>
<td>234</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/04dkv6329</td>
<td>501100000771</td>
</tr>
<tr>
<td>103</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>215</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/00yjd3n13</td>
<td>501100001711</td>
</tr>
</tbody>
</table>
<p>104 rows × 7 columns</p>
</div>
```python
# ajout de l'id avec l'index + 1
organization_export['id'] = organization_export.index + 1
# del terms_export_dedup['index']
organization_export
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:2: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>ror</th>
<th>fundref</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>https://ror.org/02s376052</td>
<td>501100001703</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>University of Geneva (UNIGE)</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>https://ror.org/01swzsf04</td>
<td>501100006389</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>Agroscope</td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td>https://ror.org/04d8ztx87</td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>https://ror.org/02bnkt322</td>
<td>501100006259</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>ETH Zurich (ETH Zurich)</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>https://ror.org/05a28rw58</td>
<td>501100003006</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>99</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://science.nasa.gov/</td>
<td>236</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/027ka1x80</td>
<td>100000104</td>
<td>100</td>
</tr>
<tr>
<td>100</td>
<td>National Science Foundation (NSF)</td>
<td>http://www.nsf.gov/</td>
<td>236</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/021nxhr62</td>
<td>100000001</td>
<td>101</td>
</tr>
<tr>
<td>101</td>
<td>Academy of Medical Science</td>
<td>https://acmedsci.ac.uk/</td>
<td>234</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/00c489v88</td>
<td>501100000691</td>
<td>102</td>
</tr>
<tr>
<td>102</td>
<td>Prostate Cancer UK</td>
<td>http://prostatecanceruk.org/</td>
<td>234</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/04dkv6329</td>
<td>501100000771</td>
<td>103</td>
</tr>
<tr>
<td>103</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>215</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/00yjd3n13</td>
<td>501100001711</td>
<td>104</td>
</tr>
</tbody>
</table>
<p>104 rows × 8 columns</p>
</div>
```python
# export de la table
result = organization_export.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/organization.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export excel
organization_export.to_excel('sample/organization.xlsx', index=False)
```
```python
# export csv
organization_export.to_csv('sample/organization.tsv', index=False)
```
## Table condition_set_term
```python
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
</tr>
</tbody>
</table>
<p>48678 rows × 16 columns</p>
</div>
```python
terms_export_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
<th>id</th>
<th>source</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>1</td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>2</td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
<td>3</td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
<td>4</td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>5</td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1315</td>
<td>NaN</td>
<td>1.0</td>
<td>-6020029623494903364</td>
<td>-5435886237991661497</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Elsevier Read &amp; Publish agreement</td>
<td>1316</td>
<td></td>
</tr>
<tr>
<td>1316</td>
<td>NaN</td>
<td>18129.0</td>
<td>-1955262099488276438</td>
<td>6359482801433181261</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>1317</td>
<td></td>
</tr>
<tr>
<td>1317</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
<td>1318</td>
<td></td>
</tr>
<tr>
<td>1318</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
<td>1319</td>
<td></td>
</tr>
<tr>
<td>1319</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>1320</td>
<td></td>
</tr>
</tbody>
</table>
<p>1320 rows × 12 columns</p>
</div>
```python
# merge des terms id
term_orig = pd.merge(term_orig, terms_export_dedup[['id_content_hash', 'id']], on='id_content_hash', how='left')
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>48678 rows × 17 columns</p>
</div>
```python
term_orig = term_orig.rename(columns = {'id' : 'term'})
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1</td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>48678 rows × 17 columns</p>
</div>
```python
condition_type
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>condition_issuer</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>Journal-only</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>Organization-only</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>Journal-organization agreement</td>
</tr>
</tbody>
</table>
</div>
```python
# merge des condition type
term_orig['condition_type'] = 3
term_orig.loc[term_orig['ror'].isna(), 'condition_type'] = 1
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
<th>term</th>
<th>condition_type</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1</td>
<td>2</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1</td>
<td>4</td>
<td>1</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0</td>
<td>5</td>
<td>1</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
</tr>
</tbody>
</table>
<p>48678 rows × 18 columns</p>
</div>
```python
organization_export
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>name</th>
<th>website</th>
<th>country</th>
<th>starting_year</th>
<th>is_funder</th>
<th>ror</th>
<th>fundref</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>École Polytechnique Fédérale de Lausanne (EPFL)</td>
<td>http://www.epfl.ch/index.en.html</td>
<td>215</td>
<td>1853</td>
<td>0</td>
<td>https://ror.org/02s376052</td>
<td>501100001703</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>University of Geneva (UNIGE)</td>
<td>https://www.unige.ch/</td>
<td>215</td>
<td>1559</td>
<td>0</td>
<td>https://ror.org/01swzsf04</td>
<td>501100006389</td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>Agroscope</td>
<td>https://www.agroscope.admin.ch/agroscope/en/ho...</td>
<td>215</td>
<td>1850</td>
<td>0</td>
<td>https://ror.org/04d8ztx87</td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>Bern University of Applied Sciences (BFH)</td>
<td>http://www.bfh.ch/en/home.html</td>
<td>215</td>
<td>1997</td>
<td>0</td>
<td>https://ror.org/02bnkt322</td>
<td>501100006259</td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>ETH Zurich (ETH Zurich)</td>
<td>https://www.ethz.ch/en.html</td>
<td>215</td>
<td>1855</td>
<td>0</td>
<td>https://ror.org/05a28rw58</td>
<td>501100003006</td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>99</td>
<td>National Aeronautics and Space Administration ...</td>
<td>http://science.nasa.gov/</td>
<td>236</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/027ka1x80</td>
<td>100000104</td>
<td>100</td>
</tr>
<tr>
<td>100</td>
<td>National Science Foundation (NSF)</td>
<td>http://www.nsf.gov/</td>
<td>236</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/021nxhr62</td>
<td>100000001</td>
<td>101</td>
</tr>
<tr>
<td>101</td>
<td>Academy of Medical Science</td>
<td>https://acmedsci.ac.uk/</td>
<td>234</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/00c489v88</td>
<td>501100000691</td>
<td>102</td>
</tr>
<tr>
<td>102</td>
<td>Prostate Cancer UK</td>
<td>http://prostatecanceruk.org/</td>
<td>234</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/04dkv6329</td>
<td>501100000771</td>
<td>103</td>
</tr>
<tr>
<td>103</td>
<td>Schweizerischer Nationalfonds zur Förderung de...</td>
<td>http://www.snf.ch/de/Seiten/default.aspx</td>
<td>215</td>
<td>0</td>
<td>1</td>
<td>https://ror.org/00yjd3n13</td>
<td>501100001711</td>
<td>104</td>
</tr>
</tbody>
</table>
<p>104 rows × 8 columns</p>
</div>
```python
# merge des organizations
term_orig = pd.merge(term_orig, organization_export[['ror', 'id']], on='ror', how='left')
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
<th>term</th>
<th>condition_type</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1</td>
<td>2</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1</td>
<td>3</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1</td>
<td>4</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>2.0</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>35.0</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>38.0</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>46.0</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>43.0</td>
</tr>
</tbody>
</table>
<p>48678 rows × 19 columns</p>
</div>
```python
term_orig = term_orig.rename(columns = {'id' : 'organization'})
term_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>archiving</th>
<th>licence</th>
<th>journal</th>
<th>prerequisite_funders</th>
<th>ror</th>
<th>comment</th>
<th>rp_id</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>ir_archiving</th>
<th>term</th>
<th>condition_type</th>
<th>organization</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>True</td>
<td>999999</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>1</td>
<td>2</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>1</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>1</td>
<td>3</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>True</td>
<td>2</td>
<td>532.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Institutional archiving locations: Any Website...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>1</td>
<td>4</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>False</td>
<td>999999</td>
<td>498.0</td>
<td>NaN</td>
<td>NaN</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>NaN</td>
<td>NaN</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>0</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48673</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/01swzsf04</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40079.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>2.0</td>
</tr>
<tr>
<td>48674</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/019whta54</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40080.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>35.0</td>
</tr>
<tr>
<td>48675</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/00vasag41</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40081.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>38.0</td>
</tr>
<tr>
<td>48676</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05r0ap620</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40082.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>46.0</td>
</tr>
<tr>
<td>48677</td>
<td>NaN</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>True</td>
<td>5</td>
<td>592.0</td>
<td>NaN</td>
<td>https://ror.org/05pmsvm27</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>40083.0</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>7687377827846095855</td>
<td>2298495942200956358</td>
<td>1</td>
<td>1320</td>
<td>3</td>
<td>43.0</td>
</tr>
</tbody>
</table>
<p>48678 rows × 19 columns</p>
</div>
```python
# concat valeurs avec même id
condition_set_term_dedup_terms = term_orig[['term', 'id_content_hash']]
condition_set_term_dedup_terms_dedup = condition_set_term_dedup_terms.drop_duplicates()
condition_set_term_dedup_terms_dedup = condition_set_term_dedup_terms_dedup.loc[condition_set_term_dedup_terms_dedup['term'].notna()]
condition_set_term_dedup_terms_dedup['term'] = condition_set_term_dedup_terms_dedup['term'].astype(int)
condition_set_term_dedup_terms_dedup['term'] = condition_set_term_dedup_terms_dedup['term'].astype(str)
condition_set_term_dedup_terms_dedup = condition_set_term_dedup_terms_dedup.groupby('id_content_hash').agg({'term': lambda x: ', '.join(x)})
condition_set_term_dedup_terms_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>term</th>
</tr>
<tr>
<th>id_content_hash</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>-9213354388875732238</td>
<td>271</td>
</tr>
<tr>
<td>-9200070744422558377</td>
<td>1039</td>
</tr>
<tr>
<td>-9171783117023104395</td>
<td>1175</td>
</tr>
<tr>
<td>-9134952646468948163</td>
<td>1283</td>
</tr>
<tr>
<td>-9133013648751406289</td>
<td>1106</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>9195001330432352893</td>
<td>1103</td>
</tr>
<tr>
<td>9200466168345981543</td>
<td>250</td>
</tr>
<tr>
<td>9213878808178729253</td>
<td>580</td>
</tr>
<tr>
<td>9218389208912777882</td>
<td>38</td>
</tr>
<tr>
<td>9219045216097074691</td>
<td>919</td>
</tr>
</tbody>
</table>
<p>1320 rows × 1 columns</p>
</div>
```python
# concat valeurs avec même id
condition_set_term_dedup_journals = term_orig[['journal', 'id_content_hash']]
condition_set_term_dedup_journals_dedup = condition_set_term_dedup_journals.drop_duplicates()
condition_set_term_dedup_journals_dedup = condition_set_term_dedup_journals_dedup.loc[condition_set_term_dedup_journals_dedup['journal'].notna()]
condition_set_term_dedup_journals_dedup['journal'] = condition_set_term_dedup_journals_dedup['journal'].astype(int)
condition_set_term_dedup_journals_dedup['journal'] = condition_set_term_dedup_journals_dedup['journal'].astype(str)
condition_set_term_dedup_journals_dedup = condition_set_term_dedup_journals_dedup.groupby('id_content_hash').agg({'journal': lambda x: ', '.join(x)})
condition_set_term_dedup_journals_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>journal</th>
</tr>
<tr>
<th>id_content_hash</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>-9213354388875732238</td>
<td>342, 219, 18, 918, 309, 543, 642, 27, 246, 64,...</td>
</tr>
<tr>
<td>-9200070744422558377</td>
<td>427</td>
</tr>
<tr>
<td>-9171783117023104395</td>
<td>548, 240, 298, 132, 3, 516</td>
</tr>
<tr>
<td>-9134952646468948163</td>
<td>990</td>
</tr>
<tr>
<td>-9133013648751406289</td>
<td>366</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>9195001330432352893</td>
<td>687</td>
</tr>
<tr>
<td>9200466168345981543</td>
<td>230</td>
</tr>
<tr>
<td>9213878808178729253</td>
<td>722</td>
</tr>
<tr>
<td>9218389208912777882</td>
<td>199</td>
</tr>
<tr>
<td>9219045216097074691</td>
<td>190</td>
</tr>
</tbody>
</table>
<p>1320 rows × 1 columns</p>
</div>
```python
# concat valeurs avec même id
condition_set_term_dedup_organizations = term_orig[['organization', 'id_content_hash']]
condition_set_term_dedup_organizations_dedup = condition_set_term_dedup_organizations.drop_duplicates()
condition_set_term_dedup_organizations_dedup = condition_set_term_dedup_organizations_dedup.loc[condition_set_term_dedup_organizations_dedup['organization'].notna()]
condition_set_term_dedup_organizations_dedup['organization'] = condition_set_term_dedup_organizations_dedup['organization'].astype(int)
condition_set_term_dedup_organizations_dedup['organization'] = condition_set_term_dedup_organizations_dedup['organization'].astype(str)
condition_set_term_dedup_organizations_dedup = condition_set_term_dedup_organizations_dedup.groupby('id_content_hash').agg({'organization': lambda x: ', '.join(x)})
condition_set_term_dedup_organizations_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>organization</th>
</tr>
<tr>
<th>id_content_hash</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>-9213354388875732238</td>
<td>75, 76, 77, 78</td>
</tr>
<tr>
<td>-9200070744422558377</td>
<td>47</td>
</tr>
<tr>
<td>-9134952646468948163</td>
<td>48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59...</td>
</tr>
<tr>
<td>-9133013648751406289</td>
<td>48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59...</td>
</tr>
<tr>
<td>-9085129519950455938</td>
<td>48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>8745253383893524719</td>
<td>48, 64, 51, 74, 68, 67, 69, 59</td>
</tr>
<tr>
<td>8913401298465203811</td>
<td>48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59...</td>
</tr>
<tr>
<td>8999447149908101495</td>
<td>48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59...</td>
</tr>
<tr>
<td>9195001330432352893</td>
<td>48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59...</td>
</tr>
<tr>
<td>9219045216097074691</td>
<td>48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59...</td>
</tr>
</tbody>
</table>
<p>277 rows × 1 columns</p>
</div>
```python
# concat valeurs avec même id : pas possible pour condition_type
condition_set_term_dedup_condition_types = term_orig[['condition_type', 'id_content_hash']]
condition_set_term_dedup_condition_types_dedup = condition_set_term_dedup_condition_types.drop_duplicates()
condition_set_term_dedup_condition_types_dedup = condition_set_term_dedup_condition_types_dedup.loc[condition_set_term_dedup_condition_types_dedup['condition_type'].notna()]
# condition_set_term_dedup_condition_types_dedup['condition_type'] = condition_set_term_dedup_condition_types_dedup['condition_type'].astype(int)
# condition_set_term_dedup_condition_types_dedup['condition_type'] = condition_set_term_dedup_condition_types_dedup['condition_type'].astype(str)
# condition_set_term_dedup_condition_types_dedup = condition_set_term_dedup_condition_types_dedup.groupby('id_content_hash').agg({'condition_type': lambda x: ', '.join(x)})
condition_set_term_dedup_condition_types_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>id_content_hash</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>-5068777248818105392</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>-1187146317861229577</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>-6827815856646016670</td>
</tr>
<tr>
<td>3</td>
<td>1</td>
<td>5388365857945903435</td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td>-2781821769548802966</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>33439</td>
<td>3</td>
<td>-681455397323083870</td>
</tr>
<tr>
<td>47344</td>
<td>3</td>
<td>6747956201225830719</td>
</tr>
<tr>
<td>47362</td>
<td>1</td>
<td>6747956201225830719</td>
</tr>
<tr>
<td>47758</td>
<td>3</td>
<td>7687377827846095855</td>
</tr>
<tr>
<td>47776</td>
<td>1</td>
<td>7687377827846095855</td>
</tr>
</tbody>
</table>
<p>1533 rows × 2 columns</p>
</div>
```python
# recuperation des ids groupés
terms_export_dedup = pd.merge(terms_export_dedup, condition_set_term_dedup_terms_dedup, on='id_content_hash', how='left')
terms_export_dedup = pd.merge(terms_export_dedup, condition_set_term_dedup_journals_dedup, on='id_content_hash', how='left')
terms_export_dedup = pd.merge(terms_export_dedup, condition_set_term_dedup_organizations_dedup, on='id_content_hash', how='left')
terms_export_dedup = pd.merge(terms_export_dedup, condition_set_term_dedup_condition_types_dedup, on='id_content_hash', how='left')
terms_export_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_sherpa</th>
<th>rp_id</th>
<th>id_content_hash</th>
<th>id_content_hash_licence</th>
<th>version</th>
<th>cost_factor</th>
<th>embargo_months</th>
<th>ir_archiving</th>
<th>licence</th>
<th>comment</th>
<th>id</th>
<th>source</th>
<th>term</th>
<th>journal</th>
<th>organization</th>
<th>condition_type</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1.0</td>
<td>NaN</td>
<td>-5068777248818105392</td>
<td>-8194612545168817012</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>1</td>
<td></td>
<td>1</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>NaN</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>2.0</td>
<td>NaN</td>
<td>-1187146317861229577</td>
<td>1080785657261440835</td>
<td>2</td>
<td>999999</td>
<td>12</td>
<td>1</td>
<td>999999</td>
<td>Institutional archiving locations: Non-Commerc...</td>
<td>2</td>
<td></td>
<td>2</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>NaN</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>3.0</td>
<td>NaN</td>
<td>-6827815856646016670</td>
<td>-4410614044147247907</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Institutional archiving locations: Any Website...</td>
<td>3</td>
<td></td>
<td>3</td>
<td>532</td>
<td>NaN</td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>4.0</td>
<td>NaN</td>
<td>5388365857945903435</td>
<td>-492868609330074007</td>
<td>3</td>
<td>355</td>
<td>0</td>
<td>1</td>
<td>2</td>
<td>Institutional archiving locations: Any Website...</td>
<td>4</td>
<td></td>
<td>4</td>
<td>532</td>
<td>NaN</td>
<td>1</td>
</tr>
<tr>
<td>4</td>
<td>5.0</td>
<td>NaN</td>
<td>-2781821769548802966</td>
<td>935766765288137110</td>
<td>1</td>
<td>999999</td>
<td>0</td>
<td>0</td>
<td>999999</td>
<td>Non institutional archiving locations: ChemRxi...</td>
<td>5</td>
<td></td>
<td>5</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>NaN</td>
<td>1</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1528</td>
<td>NaN</td>
<td>24845.0</td>
<td>-681455397323083870</td>
<td>5265079689140421989</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Wiley Read &amp; Publish agreement</td>
<td>1318</td>
<td></td>
<td>1318</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>3</td>
</tr>
<tr>
<td>1529</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
<td>1319</td>
<td></td>
<td>1319</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>3</td>
</tr>
<tr>
<td>1530</td>
<td>NaN</td>
<td>38750.0</td>
<td>6747956201225830719</td>
<td>-4648758608429098534</td>
<td>3</td>
<td>581</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>Taylor and Francis Read &amp; Publish agreement</td>
<td>1319</td>
<td></td>
<td>1319</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1</td>
</tr>
<tr>
<td>1531</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>1320</td>
<td></td>
<td>1320</td>
<td>866, 171, 186, 839, 592</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>3</td>
</tr>
<tr>
<td>1532</td>
<td>NaN</td>
<td>39164.0</td>
<td>7687377827846095855</td>
<td>2298488065455407402</td>
<td>3</td>
<td>581</td>
<td>60</td>
<td>1</td>
<td>1</td>
<td>Cambridge University Press (CUP) Read &amp; Publis...</td>
<td>1320</td>
<td></td>
<td>1320</td>
<td>866, 171, 186, 839, 592</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1</td>
</tr>
</tbody>
</table>
<p>1533 rows × 16 columns</p>
</div>
```python
condition_sets_orig = terms_export_dedup[['term', 'condition_type', 'organization', 'journal']]
condition_sets_orig
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>term</th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1528</td>
<td>1318</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
</tr>
<tr>
<td>1529</td>
<td>1319</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
</tr>
<tr>
<td>1530</td>
<td>1319</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
</tr>
<tr>
<td>1531</td>
<td>1320</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
</tr>
<tr>
<td>1532</td>
<td>1320</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
</tr>
</tbody>
</table>
<p>1533 rows × 4 columns</p>
</div>
```python
# ajout d'un hash unique pour chaque variante
condition_sets_orig['id_term_hash'] = condition_sets_orig.apply(lambda x: hash(tuple(x[['condition_type', 'organization', 'journal']])), axis = 1)
condition_sets_orig
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:2: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>term</th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1528</td>
<td>1318</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
</tr>
<tr>
<td>1529</td>
<td>1319</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
</tr>
<tr>
<td>1530</td>
<td>1319</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
</tr>
<tr>
<td>1531</td>
<td>1320</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
</tr>
<tr>
<td>1532</td>
<td>1320</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
</tr>
</tbody>
</table>
<p>1533 rows × 5 columns</p>
</div>
```python
# grouper les termes qui ont les mêmes valeurs pour le reste
condition_sets_orig_terms = condition_sets_orig[['term', 'id_term_hash']]
condition_sets_orig_terms_dedup = condition_sets_orig_terms.drop_duplicates()
condition_sets_orig_terms_dedup = condition_sets_orig_terms_dedup.loc[condition_sets_orig_terms_dedup['term'].notna()]
condition_sets_orig_terms_dedup['term'] = condition_sets_orig_terms_dedup['term'].astype(int)
condition_sets_orig_terms_dedup['term'] = condition_sets_orig_terms_dedup['term'].astype(str)
condition_sets_orig_terms_dedup = condition_sets_orig_terms_dedup.groupby('id_term_hash').agg({'term': lambda x: ', '.join(x)})
condition_sets_orig_terms_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>term</th>
</tr>
<tr>
<th>id_term_hash</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>-9221122160312283608</td>
<td>796</td>
</tr>
<tr>
<td>-9194263828544732083</td>
<td>812</td>
</tr>
<tr>
<td>-9192944961126408089</td>
<td>1246</td>
</tr>
<tr>
<td>-9191653994283170820</td>
<td>965</td>
</tr>
<tr>
<td>-9180782299480364441</td>
<td>1185</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>9197647807999611822</td>
<td>421</td>
</tr>
<tr>
<td>9200686802301911565</td>
<td>359</td>
</tr>
<tr>
<td>9203218741230767213</td>
<td>1056</td>
</tr>
<tr>
<td>9211734360905731286</td>
<td>630, 631</td>
</tr>
<tr>
<td>9214772761176685077</td>
<td>706</td>
</tr>
</tbody>
</table>
<p>1149 rows × 1 columns</p>
</div>
```python
# ajout des ids groupées
condition_sets_orig_terms = pd.merge(condition_sets_orig, condition_sets_orig_terms_dedup, on='id_term_hash', how='left')
condition_sets_orig_terms
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>term_x</th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term_y</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>1</td>
<td>NaN</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1528</td>
<td>1318</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
</tr>
<tr>
<td>1529</td>
<td>1319</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
</tr>
<tr>
<td>1530</td>
<td>1319</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
</tr>
<tr>
<td>1531</td>
<td>1320</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
</tr>
<tr>
<td>1532</td>
<td>1320</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>1533 rows × 6 columns</p>
</div>
```python
# rename terms
del condition_sets_orig_terms['term_x']
condition_sets_orig_terms = condition_sets_orig_terms.rename(columns = {'term_y' : 'term'})
condition_sets_orig_terms
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
</tr>
<tr>
<td>3</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td>NaN</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1528</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
</tr>
<tr>
<td>1529</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
</tr>
<tr>
<td>1530</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
</tr>
<tr>
<td>1531</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
</tr>
<tr>
<td>1532</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>1533 rows × 5 columns</p>
</div>
```python
# test duplicates
condition_sets_orig_terms.loc[condition_sets_orig_terms.duplicated()].sort_values(by='term')
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
</tr>
<tr>
<td>1187</td>
<td>1</td>
<td>NaN</td>
<td>779</td>
<td>-9104022108665859378</td>
<td>1001, 1002, 1003</td>
</tr>
<tr>
<td>1188</td>
<td>1</td>
<td>NaN</td>
<td>779</td>
<td>-9104022108665859378</td>
<td>1001, 1002, 1003</td>
</tr>
<tr>
<td>1190</td>
<td>1</td>
<td>NaN</td>
<td>7, 22</td>
<td>-5795971402582868051</td>
<td>1004, 1005</td>
</tr>
<tr>
<td>1194</td>
<td>1</td>
<td>NaN</td>
<td>825</td>
<td>-2985725204066841336</td>
<td>1008, 1009</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1161</td>
<td>1</td>
<td>NaN</td>
<td>855</td>
<td>158530994336307876</td>
<td>978, 979</td>
</tr>
<tr>
<td>1168</td>
<td>1</td>
<td>NaN</td>
<td>654</td>
<td>-5164377982436891368</td>
<td>984, 985</td>
</tr>
<tr>
<td>1179</td>
<td>1</td>
<td>NaN</td>
<td>751</td>
<td>-1857992192228010123</td>
<td>993, 994, 995</td>
</tr>
<tr>
<td>1180</td>
<td>1</td>
<td>NaN</td>
<td>751</td>
<td>-1857992192228010123</td>
<td>993, 994, 995</td>
</tr>
<tr>
<td>1182</td>
<td>1</td>
<td>NaN</td>
<td>531</td>
<td>-3353627437951234546</td>
<td>996, 997</td>
</tr>
</tbody>
</table>
<p>384 rows × 5 columns</p>
</div>
```python
condition_sets_orig_terms.loc[condition_sets_orig_terms.duplicated()].shape[0]
```
384
```python
condition_sets_orig_terms_dedup = condition_sets_orig_terms.drop_duplicates()
condition_sets_orig_terms_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td>NaN</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
</tr>
<tr>
<td>6</td>
<td>1</td>
<td>NaN</td>
<td>498</td>
<td>-713947468848485257</td>
<td>7, 8</td>
</tr>
<tr>
<td>8</td>
<td>1</td>
<td>NaN</td>
<td>789</td>
<td>-5332045039572836456</td>
<td>9, 10, 11, 12</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1528</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
</tr>
<tr>
<td>1529</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
</tr>
<tr>
<td>1530</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
</tr>
<tr>
<td>1531</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
</tr>
<tr>
<td>1532</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>1149 rows × 5 columns</p>
</div>
```python
# ajout des champs manquants
condition_sets_orig_terms_dedup['comment'] = ''
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:2: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
```python
# remplacement des "nan"
condition_sets_orig_terms_dedup.loc[condition_sets_orig_terms_dedup['journal'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
# remplacement des "nan"
condition_sets_orig_terms_dedup.loc[condition_sets_orig_terms_dedup['term'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
# remplacement des "nan"
condition_sets_orig_terms_dedup.loc[condition_sets_orig_terms_dedup['condition_type'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
# remplacement des "nan"
condition_sets_orig_terms_dedup.loc[condition_sets_orig_terms_dedup['organization'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>NaN</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>NaN</td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td>NaN</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
<td></td>
</tr>
<tr>
<td>6</td>
<td>1</td>
<td>NaN</td>
<td>498</td>
<td>-713947468848485257</td>
<td>7, 8</td>
<td></td>
</tr>
<tr>
<td>8</td>
<td>1</td>
<td>NaN</td>
<td>789</td>
<td>-5332045039572836456</td>
<td>9, 10, 11, 12</td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1515</td>
<td>1</td>
<td>NaN</td>
<td>870</td>
<td>3031852869228425137</td>
<td>1306, 1307</td>
<td></td>
</tr>
<tr>
<td>1517</td>
<td>1</td>
<td>NaN</td>
<td>41</td>
<td>-7902056154606509806</td>
<td>1308, 1309</td>
<td></td>
</tr>
<tr>
<td>1519</td>
<td>1</td>
<td>NaN</td>
<td>80</td>
<td>7657867214417959485</td>
<td>1310, 1311</td>
<td></td>
</tr>
<tr>
<td>1521</td>
<td>1</td>
<td>NaN</td>
<td>533</td>
<td>7303862352984295282</td>
<td>1312, 1313</td>
<td></td>
</tr>
<tr>
<td>1523</td>
<td>1</td>
<td>NaN</td>
<td>608</td>
<td>6548018561563906677</td>
<td>1314, 1315</td>
<td></td>
</tr>
</tbody>
</table>
<p>661 rows × 6 columns</p>
</div>
```python
# remplacement des "nan"
condition_sets_orig_terms_dedup['organization'] = condition_sets_orig_terms_dedup['organization'].fillna('')
condition_sets_orig_terms_dedup
```
C:\Users\iriarte\AppData\Local\Continuum\anaconda3\lib\site-packages\ipykernel_launcher.py:2: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td></td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td></td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td></td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
<td></td>
</tr>
<tr>
<td>6</td>
<td>1</td>
<td></td>
<td>498</td>
<td>-713947468848485257</td>
<td>7, 8</td>
<td></td>
</tr>
<tr>
<td>8</td>
<td>1</td>
<td></td>
<td>789</td>
<td>-5332045039572836456</td>
<td>9, 10, 11, 12</td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1528</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
<td></td>
</tr>
<tr>
<td>1529</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
<td></td>
</tr>
<tr>
<td>1530</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
<td></td>
</tr>
<tr>
<td>1531</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
<td></td>
</tr>
<tr>
<td>1532</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
<td></td>
</tr>
</tbody>
</table>
<p>1149 rows × 6 columns</p>
</div>
```python
# convertir l'index en id
condition_sets_orig_terms_dedup = condition_sets_orig_terms_dedup.reset_index()
# ajout de l'id avec l'index + 1
condition_sets_orig_terms_dedup['id'] = condition_sets_orig_terms_dedup['index'] + 1
del condition_sets_orig_terms_dedup['index']
condition_sets_orig_terms_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td></td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td></td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td></td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
<td></td>
<td>5</td>
</tr>
<tr>
<td>3</td>
<td>1</td>
<td></td>
<td>498</td>
<td>-713947468848485257</td>
<td>7, 8</td>
<td></td>
<td>7</td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td></td>
<td>789</td>
<td>-5332045039572836456</td>
<td>9, 10, 11, 12</td>
<td></td>
<td>9</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
<td></td>
<td>1529</td>
</tr>
<tr>
<td>1145</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
<td></td>
<td>1530</td>
</tr>
<tr>
<td>1146</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
<td></td>
<td>1531</td>
</tr>
<tr>
<td>1147</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
<td></td>
<td>1532</td>
</tr>
<tr>
<td>1148</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
<td></td>
<td>1533</td>
</tr>
</tbody>
</table>
<p>1149 rows × 7 columns</p>
</div>
```python
# convertir l'index en id
condition_sets_orig_terms_dedup = condition_sets_orig_terms_dedup.reset_index()
# ajout de l'id avec l'index + 1
condition_sets_orig_terms_dedup['id'] = condition_sets_orig_terms_dedup['index'] + 1
del condition_sets_orig_terms_dedup['index']
condition_sets_orig_terms_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td></td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td></td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
<td></td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td></td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1</td>
<td></td>
<td>498</td>
<td>-713947468848485257</td>
<td>7, 8</td>
<td></td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td></td>
<td>789</td>
<td>-5332045039572836456</td>
<td>9, 10, 11, 12</td>
<td></td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
<td></td>
<td>1145</td>
</tr>
<tr>
<td>1145</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
<td></td>
<td>1146</td>
</tr>
<tr>
<td>1146</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
<td></td>
<td>1147</td>
</tr>
<tr>
<td>1147</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
<td></td>
<td>1148</td>
</tr>
<tr>
<td>1148</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
<td></td>
<td>1149</td>
</tr>
</tbody>
</table>
<p>1149 rows × 7 columns</p>
</div>
```python
# export de la table
result = condition_sets_orig_terms_dedup[['id', 'condition_type', 'organization', 'journal', 'term', 'comment']].to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/condition_set.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export excel
condition_sets_orig_terms_dedup[['id', 'condition_type', 'organization', 'journal', 'term', 'comment']].to_excel('sample/condition_set.xlsx', index=False)
```
```python
# export csv
condition_sets_orig_terms_dedup[['id', 'condition_type', 'organization', 'journal', 'term', 'comment']].to_csv('sample/condition_set.tsv', index=False)
```
## Table organization_condition_set
```python
condition_sets_orig_terms_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td></td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td>-5197283134070040275</td>
<td>1, 2</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td></td>
<td>532</td>
<td>-3428409893954144223</td>
<td>3, 4</td>
<td></td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td></td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td>5362274893926121442</td>
<td>5, 6</td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1</td>
<td></td>
<td>498</td>
<td>-713947468848485257</td>
<td>7, 8</td>
<td></td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td></td>
<td>789</td>
<td>-5332045039572836456</td>
<td>9, 10, 11, 12</td>
<td></td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
<td></td>
<td>1145</td>
</tr>
<tr>
<td>1145</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
<td></td>
<td>1146</td>
</tr>
<tr>
<td>1146</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
<td></td>
<td>1147</td>
</tr>
<tr>
<td>1147</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
<td></td>
<td>1148</td>
</tr>
<tr>
<td>1148</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
<td></td>
<td>1149</td>
</tr>
</tbody>
</table>
<p>1149 rows × 7 columns</p>
</div>
```python
condition_sets_orig_terms_dedup.loc[(condition_sets_orig_terms_dedup['organization'].notna()) & (condition_sets_orig_terms_dedup['organization'] != '')]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_type</th>
<th>organization</th>
<th>journal</th>
<th>id_term_hash</th>
<th>term</th>
<th>comment</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>5</td>
<td>3</td>
<td>47</td>
<td>789</td>
<td>-6118989085408562349</td>
<td>13</td>
<td></td>
<td>6</td>
</tr>
<tr>
<td>11</td>
<td>3</td>
<td>47</td>
<td>668, 576, 371, 410, 849, 184, 670, 559, 58, 16...</td>
<td>7026376488862543796</td>
<td>22</td>
<td></td>
<td>12</td>
</tr>
<tr>
<td>12</td>
<td>1</td>
<td>47</td>
<td>668, 576, 371, 410, 849, 184, 670, 559, 58, 16...</td>
<td>8899497448130036698</td>
<td>22</td>
<td></td>
<td>13</td>
</tr>
<tr>
<td>21</td>
<td>1</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>985, 485, 787, 415, 189, 395, 652, 83, 227, 44...</td>
<td>3530505283797139276</td>
<td>42</td>
<td></td>
<td>22</td>
</tr>
<tr>
<td>22</td>
<td>3</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>985, 485, 787, 415, 189, 395, 652, 83, 227, 44...</td>
<td>3056402465711846666</td>
<td>42</td>
<td></td>
<td>23</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td>-32115995447722756</td>
<td>1318</td>
<td></td>
<td>1145</td>
</tr>
<tr>
<td>1145</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>4789694892756018439</td>
<td>1319</td>
<td></td>
<td>1146</td>
</tr>
<tr>
<td>1146</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td>7722626036678389533</td>
<td>1319</td>
<td></td>
<td>1147</td>
</tr>
<tr>
<td>1147</td>
<td>3</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>6902392350219571553</td>
<td>1320</td>
<td></td>
<td>1148</td>
</tr>
<tr>
<td>1148</td>
<td>1</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>866, 171, 186, 839, 592</td>
<td>4611302665250055299</td>
<td>1320</td>
<td></td>
<td>1149</td>
</tr>
</tbody>
</table>
<p>488 rows × 7 columns</p>
</div>
```python
# creation du DF
# col_names = ['id',
# 'organization',
# 'condition_set',
# 'valid_from',
# 'valid_until'
# ]
# organization_condition = pd.DataFrame(columns = col_names)
organization_condition = condition_sets_orig_terms_dedup.loc[(condition_sets_orig_terms_dedup['organization'].notna()) & (condition_sets_orig_terms_dedup['organization'] != '')][['id', 'organization', 'term']]
organization_condition
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>5</td>
<td>6</td>
<td>47</td>
<td>13</td>
</tr>
<tr>
<td>11</td>
<td>12</td>
<td>47</td>
<td>22</td>
</tr>
<tr>
<td>12</td>
<td>13</td>
<td>47</td>
<td>22</td>
</tr>
<tr>
<td>21</td>
<td>22</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
</tr>
<tr>
<td>22</td>
<td>23</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>1145</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1318</td>
</tr>
<tr>
<td>1145</td>
<td>1146</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
</tr>
<tr>
<td>1146</td>
<td>1147</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
</tr>
<tr>
<td>1147</td>
<td>1148</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>488 rows × 3 columns</p>
</div>
```python
# extraction des terms ids
organization_condition_split = organization_condition.assign(term = organization_condition.term.str.split(',')).explode('term')
organization_condition_split
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>5</td>
<td>6</td>
<td>47</td>
<td>13</td>
</tr>
<tr>
<td>11</td>
<td>12</td>
<td>47</td>
<td>22</td>
</tr>
<tr>
<td>12</td>
<td>13</td>
<td>47</td>
<td>22</td>
</tr>
<tr>
<td>21</td>
<td>22</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
</tr>
<tr>
<td>22</td>
<td>23</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>1145</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1318</td>
</tr>
<tr>
<td>1145</td>
<td>1146</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
</tr>
<tr>
<td>1146</td>
<td>1147</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
</tr>
<tr>
<td>1147</td>
<td>1148</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>490 rows × 3 columns</p>
</div>
```python
organization_condition_split.loc[organization_condition_split['organization'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
organization_condition_split.loc[organization_condition_split['term'].isna()]
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
</tr>
</thead>
<tbody>
</tbody>
</table>
</div>
```python
organization_condition_split['term'] = organization_condition_split['term'].astype(int)
organization_condition_split
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>5</td>
<td>6</td>
<td>47</td>
<td>13</td>
</tr>
<tr>
<td>11</td>
<td>12</td>
<td>47</td>
<td>22</td>
</tr>
<tr>
<td>12</td>
<td>13</td>
<td>47</td>
<td>22</td>
</tr>
<tr>
<td>21</td>
<td>22</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
</tr>
<tr>
<td>22</td>
<td>23</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>1145</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1318</td>
</tr>
<tr>
<td>1145</td>
<td>1146</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
</tr>
<tr>
<td>1146</td>
<td>1147</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
</tr>
<tr>
<td>1147</td>
<td>1148</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>490 rows × 3 columns</p>
</div>
```python
# ajout du ROR
terms_export_dates
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id_content_hash</th>
<th>ror</th>
<th>valid_from</th>
<th>valid_until</th>
<th>term</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>-6020029623494903364</td>
<td>https://ror.org/04d8ztx87</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>1</td>
<td>-6020029623494903364</td>
<td>https://ror.org/02bnkt322</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>2</td>
<td>-6020029623494903364</td>
<td>https://ror.org/00zg4za48</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>3</td>
<td>-6020029623494903364</td>
<td>https://ror.org/02s376052</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>4</td>
<td>-6020029623494903364</td>
<td>https://ror.org/05a28rw58</td>
<td>2020-01-01</td>
<td>2023-12-31</td>
<td>1316</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>40078</td>
<td>7687377827846095855</td>
<td>https://ror.org/01swzsf04</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40079</td>
<td>7687377827846095855</td>
<td>https://ror.org/019whta54</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40080</td>
<td>7687377827846095855</td>
<td>https://ror.org/00vasag41</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40081</td>
<td>7687377827846095855</td>
<td>https://ror.org/05r0ap620</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
<tr>
<td>40082</td>
<td>7687377827846095855</td>
<td>https://ror.org/05pmsvm27</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>1320</td>
</tr>
</tbody>
</table>
<p>40083 rows × 5 columns</p>
</div>
```python
# merge pour obtenir les dates
organization_condition_split = pd.merge(organization_condition_split, terms_export_dates[['term', 'valid_from', 'valid_until']], on='term', how='left')
organization_condition_split
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>47</td>
<td>13</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>12</td>
<td>47</td>
<td>22</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>13</td>
<td>47</td>
<td>22</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>22</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>23</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>48610</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48611</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48612</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48613</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>48614</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>48615 rows × 5 columns</p>
</div>
```python
# dédoublonage
organization_condition_split_dedup = organization_condition_split.drop_duplicates()
organization_condition_split_dedup
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>47</td>
<td>13</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>12</td>
<td>47</td>
<td>22</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>13</td>
<td>47</td>
<td>22</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>22</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>23</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>32042</td>
<td>1145</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1318</td>
<td>2021-01-01</td>
<td>2024-12-31</td>
</tr>
<tr>
<td>45947</td>
<td>1146</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>46361</td>
<td>1147</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>46775</td>
<td>1148</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>47695</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>490 rows × 5 columns</p>
</div>
```python
organization_condition = pd.merge(organization_condition, organization_condition_split_dedup[['id', 'valid_from', 'valid_until']], on='id', how='left')
organization_condition
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>organization</th>
<th>term</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>47</td>
<td>13</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>1</td>
<td>12</td>
<td>47</td>
<td>22</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>2</td>
<td>13</td>
<td>47</td>
<td>22</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>3</td>
<td>22</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>4</td>
<td>23</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td>NaN</td>
<td>NaN</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>485</td>
<td>1145</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1318</td>
<td>2021-01-01</td>
<td>2024-12-31</td>
</tr>
<tr>
<td>486</td>
<td>1146</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>487</td>
<td>1147</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>488</td>
<td>1148</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>489</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>490 rows × 5 columns</p>
</div>
```python
organization_condition = organization_condition.rename(columns = {'id' : 'condition_set'})
organization_condition['valid_from'] = organization_condition['valid_from'].fillna('')
organization_condition['valid_until'] = organization_condition['valid_until'].fillna('')
organization_condition
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_set</th>
<th>organization</th>
<th>term</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>47</td>
<td>13</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>12</td>
<td>47</td>
<td>22</td>
<td></td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>13</td>
<td>47</td>
<td>22</td>
<td></td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>22</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td></td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>23</td>
<td>48, 64, 51, 74, 68, 67, 69, 59, 75, 76, 77, 78</td>
<td>42</td>
<td></td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>485</td>
<td>1145</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1318</td>
<td>2021-01-01</td>
<td>2024-12-31</td>
</tr>
<tr>
<td>486</td>
<td>1146</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>487</td>
<td>1147</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1319</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>488</td>
<td>1148</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>489</td>
<td>1149</td>
<td>3, 4, 6, 24, 1, 5, 31, 27, 7, 8, 28, 11, 44, 1...</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>490 rows × 5 columns</p>
</div>
```python
# split final pour avoir une ligne par organization
organization_condition_fin = organization_condition.assign(organization = organization_condition.organization.str.split(',')).explode('organization')
organization_condition_fin
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_set</th>
<th>organization</th>
<th>term</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>47</td>
<td>13</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>12</td>
<td>47</td>
<td>22</td>
<td></td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>13</td>
<td>47</td>
<td>22</td>
<td></td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>22</td>
<td>48</td>
<td>42</td>
<td></td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>22</td>
<td>64</td>
<td>42</td>
<td></td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>489</td>
<td>1149</td>
<td>2</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>489</td>
<td>1149</td>
<td>35</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>489</td>
<td>1149</td>
<td>38</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>489</td>
<td>1149</td>
<td>46</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
<tr>
<td>489</td>
<td>1149</td>
<td>43</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
</tr>
</tbody>
</table>
<p>6834 rows × 5 columns</p>
</div>
```python
# ajout de l'id avec l'index + 1
organization_condition_fin = organization_condition_fin.reset_index()
organization_condition_fin['id'] = organization_condition_fin.index + 1
del organization_condition_fin['index']
organization_condition_fin
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_set</th>
<th>organization</th>
<th>term</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>47</td>
<td>13</td>
<td></td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>12</td>
<td>47</td>
<td>22</td>
<td></td>
<td></td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>13</td>
<td>47</td>
<td>22</td>
<td></td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>22</td>
<td>48</td>
<td>42</td>
<td></td>
<td></td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>22</td>
<td>64</td>
<td>42</td>
<td></td>
<td></td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>6829</td>
<td>1149</td>
<td>2</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>6830</td>
</tr>
<tr>
<td>6830</td>
<td>1149</td>
<td>35</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>6831</td>
</tr>
<tr>
<td>6831</td>
<td>1149</td>
<td>38</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>6832</td>
</tr>
<tr>
<td>6832</td>
<td>1149</td>
<td>46</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>6833</td>
</tr>
<tr>
<td>6833</td>
<td>1149</td>
<td>43</td>
<td>1320</td>
<td>2021-01-01</td>
<td>2023-12-31</td>
<td>6834</td>
</tr>
</tbody>
</table>
<p>6834 rows × 6 columns</p>
</div>
```python
# export de la table
result = organization_condition_fin[['id', 'condition_set', 'organization', 'valid_from', 'valid_until']].to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/organization_condition.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export excel
organization_condition_fin[['id', 'condition_set', 'organization', 'valid_from', 'valid_until']].to_excel('sample/organization_condition.xlsx', index=False)
```
```python
# export csv
organization_condition_fin[['id', 'condition_set', 'organization', 'valid_from', 'valid_until']].to_csv('sample/organization_condition.tsv', index=False)
```
## Table journal_condition_set
```python
# creation du DF
# col_names = ['id',
# 'journal',
# 'condition_set',
# 'valid_from',
# 'valid_until'
# ]
# journal_condition = pd.DataFrame(columns = col_names)
journal_condition = condition_sets_orig_terms_dedup.loc[(condition_sets_orig_terms_dedup['journal'].notna()) & (condition_sets_orig_terms_dedup['journal'] != '')][['id', 'journal']]
journal_condition
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>id</th>
<th>journal</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>532</td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>498</td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>789</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>1145</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
</tr>
<tr>
<td>1145</td>
<td>1146</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
</tr>
<tr>
<td>1146</td>
<td>1147</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
</tr>
<tr>
<td>1147</td>
<td>1148</td>
<td>866, 171, 186, 839, 592</td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>866, 171, 186, 839, 592</td>
</tr>
</tbody>
</table>
<p>1149 rows × 2 columns</p>
</div>
```python
journal_condition = journal_condition.rename(columns = {'id' : 'condition_set'})
journal_condition['valid_from'] = ''
journal_condition['valid_until'] = ''
journal_condition
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_set</th>
<th>journal</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>532, 482, 452, 663, 323, 674, 317, 154, 439, 5...</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>2</td>
<td>532</td>
<td></td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>3</td>
<td>498, 70, 359, 573, 63, 66, 274, 116, 384, 163,...</td>
<td></td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>4</td>
<td>498</td>
<td></td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>5</td>
<td>789</td>
<td></td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1144</td>
<td>1145</td>
<td>942, 854, 933, 297, 130, 144, 549, 283, 512, 1...</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1145</td>
<td>1146</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1146</td>
<td>1147</td>
<td>714, 633, 48, 704, 408, 535, 754, 581, 979</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1147</td>
<td>1148</td>
<td>866, 171, 186, 839, 592</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>866, 171, 186, 839, 592</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
<p>1149 rows × 4 columns</p>
</div>
```python
# split final pour avoir une ligne par journal
journal_condition_fin = journal_condition.assign(journal = journal_condition.journal.str.split(',')).explode('journal')
journal_condition_fin
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_set</th>
<th>journal</th>
<th>valid_from</th>
<th>valid_until</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>532</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>482</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>452</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>663</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>323</td>
<td></td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>866</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>171</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>186</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>839</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1148</td>
<td>1149</td>
<td>592</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
<p>3033 rows × 4 columns</p>
</div>
```python
# ajout de l'id avec l'index + 1
journal_condition_fin = journal_condition_fin.reset_index()
journal_condition_fin['id'] = journal_condition_fin.index + 1
del journal_condition_fin['index']
journal_condition_fin
```
<div>
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
<table border="1" class="dataframe">
<thead>
<tr style="text-align: right;">
<th></th>
<th>condition_set</th>
<th>journal</th>
<th>valid_from</th>
<th>valid_until</th>
<th>id</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>532</td>
<td></td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>482</td>
<td></td>
<td></td>
<td>2</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>452</td>
<td></td>
<td></td>
<td>3</td>
</tr>
<tr>
<td>3</td>
<td>1</td>
<td>663</td>
<td></td>
<td></td>
<td>4</td>
</tr>
<tr>
<td>4</td>
<td>1</td>
<td>323</td>
<td></td>
<td></td>
<td>5</td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
<td>...</td>
</tr>
<tr>
<td>3028</td>
<td>1149</td>
<td>866</td>
<td></td>
<td></td>
<td>3029</td>
</tr>
<tr>
<td>3029</td>
<td>1149</td>
<td>171</td>
<td></td>
<td></td>
<td>3030</td>
</tr>
<tr>
<td>3030</td>
<td>1149</td>
<td>186</td>
<td></td>
<td></td>
<td>3031</td>
</tr>
<tr>
<td>3031</td>
<td>1149</td>
<td>839</td>
<td></td>
<td></td>
<td>3032</td>
</tr>
<tr>
<td>3032</td>
<td>1149</td>
<td>592</td>
<td></td>
<td></td>
<td>3033</td>
</tr>
</tbody>
</table>
<p>3033 rows × 5 columns</p>
</div>
```python
# export de la table
result = journal_condition_fin.to_json(orient='records', force_ascii=False)
parsed = json.loads(result)
with open('sample/journal_condition.json', 'w', encoding='utf-8') as file:
json.dump(parsed, file, indent=2, ensure_ascii=False)
```
```python
# export excel
journal_condition_fin.to_excel('sample/journal_condition.xlsx', index=False)
```
```python
# export csv
journal_condition_fin.to_csv('sample/journal_condition.tsv', index=False)
```
```python
```

Event Timeline