{ "cells": [ { "cell_type": "markdown", "source": [ "# TODO" ], "metadata": { "collapsed": false } }, { "cell_type": "markdown", "source": [ "# So far:\n", "## WOS\n", "- WOS: sample dataset (see query for details) -> only Peoples' Republic of China /to be specified!/ ~11-13.000 initial record\n", "- WOS - METRIX merge --> filter for articles indexed by METRIX\n", "- Affiliations and Country (from adresses) extraction\n", "- Filter for articles with 'foreign collaboration' at least two authors from different regions (EU, CH)\n", "- Fuzzy association between Institute and Country (Affiliation - Address (institution) - Adress (Country)\n", "## PATSTAT\n", "- PATSTAT scope extraction (initial: 2011-2024, has both EU and China /to be specified!/ in appln-person person map --> 'foreign collab'\n", "- I love harmonized entities, harmonized even further (sector)\n", "- Fetched CPC description data taxonomy, merged with PATSTAT: lost around one percent of the records" ], "metadata": { "collapsed": false, "pycharm": { "name": "#%% md\n" } } }, { "cell_type": "code", "execution_count": 1, "outputs": [], "source": [ "# Patent classes" ], "metadata": { "collapsed": false, "pycharm": { "name": "#%%\n" } } }, { "cell_type": "code", "execution_count": 2, "outputs": [], "source": [ "# AI keywords PATSTAT and WOS" ], "metadata": { "collapsed": false } }, { "cell_type": "markdown", "source": [ "# What I need\n", "- List of visuals & tables / in a specified manner/ can be" ], "metadata": { "collapsed": false, "pycharm": { "name": "#%% md\n" } } }, { "cell_type": "code", "execution_count": null, "outputs": [], "source": [], "metadata": { "collapsed": false, "pycharm": { "name": "#%%\n" } } } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 2 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython2", "version": "2.7.6" } }, "nbformat": 4, "nbformat_minor": 0 }