{"id":1786301,"date":"2022-12-26T14:50:50","date_gmt":"2022-12-26T19:50:50","guid":{"rendered":"https:\/\/www.analyticsvidhya.com\/?p=100235"},"modified":"2022-12-26T14:50:50","modified_gmt":"2022-12-26T19:50:50","slug":"crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark","status":"publish","type":"station","link":"https:\/\/platodata.io\/plato-data\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark\/","title":{"rendered":"Crafting Serverless ETL Pipeline Using AWS Glue and PySpark"},"content":{"rendered":"<div>\n<p id=\"8960\" data-selectable-paragraph>ETL (Extract, Transform, and Load) is a very common technique in data engineering. It involves extracting the operational data from various sources, transforming it into a format suitable for business needs, and loading it into data storage systems.<\/p>\n<p id=\"535d\" data-selectable-paragraph>Traditionally, ETL processes are run on servers, which&nbsp;ongoing maintenance and manual intervention. However, with the rise of serverless technology, it is now possible to perform ETL without the need for dedicated servers. This is where AWS Glue and PySpark come into play.<\/p>\n<p id=\"7883\" data-selectable-paragraph>AWS Glue is a fully managed ETL offering from AWS that makes it easy to manipulate and move data between various data stores. It can crawl data sources, identify data types and formats, and suggest schemas, making it easy to extract, transform, and load data for analytics.<\/p>\n<p id=\"1a8e\" data-selectable-paragraph><a href=\"https:\/\/www.analyticsvidhya.com\/blog\/2022\/12\/case-study-restaurants-insights-using-pyspark-databricks\/\" target=\"_blank\" rel=\"noopener\">PySpark<\/a> is the Python wrapper of Apache Spark (which is a powerful open-source distributed computing framework widely used for big data processing).<\/p>\n<h2 id=\"4ff8\">How Do AWS Glue and PySpark Work?<\/h2>\n<\/div>\n<div>\n<p id=\"74ec\" data-selectable-paragraph>Together, Glue and PySpark provide a powerful, serverless ETL solution that is easy to use and scalable. Here\u2019s how it works:<\/p>\n<ol>\n<li id=\"2e5f\" data-selectable-paragraph>First, Glue crawls your data sources to identify the data formats and suggest a schema. You can then edit and refine the schema as needed.<\/li>\n<li id=\"4e0a\" data-selectable-paragraph>Next, you use PySpark to write ETL scripts that extract the data from the sources, transform it according to the schema, and load it into your data warehouse or other storage systems.<\/li>\n<li id=\"b340\" data-selectable-paragraph>The PySpark scripts are then executed by Glue, which automatically scales up or down to handle the workload. This allows you to process large amounts of data without having to worry about managing servers or infrastructure.<\/li>\n<li id=\"10b7\" data-selectable-paragraph>Finally, Glue also provides a rich set of tools for monitoring and managing your ETL processes, including a visual workflow editor, job scheduling, and data lineage tracking.<\/li>\n<\/ol>\n<h2 id=\"421a\">The Usecase<\/h2>\n<\/div>\n<div>\n<p id=\"6b81\" data-selectable-paragraph>In this use case, we will develop a sample data pipeline (Glue Job) using the AWS typescript SDK, which will read the data from a dynamo DB table, perform some data transformation using PySpark and write it into an S3 bucket in CSV format. DynamoDB is a fully managed NoSQL database service offered by AWS, which is easily scalable and used in multiple applications. On the other hand, S3 is a general-purpose storage offering by AWS.<\/p>\n<p id=\"7b4d\" data-selectable-paragraph>For simplicity, we can consider this as a use case for moving an application or transactional data to the data lake.<\/p>\n<figure>\n<div role=\"button\">\n<div><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter\" role=\"presentation\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark.png\" alt=\"AWS Glue\" width=\"700\" height=\"410\"><\/div>\n<\/div>\n<\/figure>\n<h2 id=\"9d07\">Project Structure<\/h2>\n<\/div>\n<div>\n<ul>\n<li id=\"6164\" data-selectable-paragraph><code><strong>lib\/cdk-workshop-stack.ts<\/strong><\/code> is where the CDK application\u2019s main stack is defined. We will define the required AWS resources here.<\/li>\n<li id=\"5aa1\" data-selectable-paragraph><code>bin\/cdk-workshop.ts<\/code> is the entry point or \u2018main\u2019 function of any CDK application. It will load the stack defined in <code>lib\/cdk-workshop-stack.ts<\/code>.<\/li>\n<li id=\"a4b3\" data-selectable-paragraph><code>package.json<\/code> is the npm module manifest, which contains information like the name of the CDK application, version, dependencies, and different build scripts<\/li>\n<li id=\"2b7d\" data-selectable-paragraph><code>cdk.json<\/code> tells the toolkit how to run your&nbsp; CDK application. In our case, it will be <code>\"npx ts-node bin\/cdk-workshop.ts\"<\/code><\/li>\n<li id=\"f62e\" data-selectable-paragraph><code>stack-configuration.ts<\/code> is used to maintain configuration-related details like the name of the table, glue job, etc.<\/li>\n<li id=\"77c6\" data-selectable-paragraph><code>assets\/glue-cdk-asset-etl.py<\/code> is the python script which runs from the glue job<\/li>\n<li id=\"380e\" data-selectable-paragraph><code>tsconfig.json<\/code> your project\u2019s <a href=\"https:\/\/www.typescriptlang.org\/docs\/handbook\/tsconfig-json.html\" target=\"_blank\" rel=\"noopener ugc nofollow\">typescript configuration<\/a><\/li>\n<li id=\"c2ca\" data-selectable-paragraph><code>.gitignore<\/code> and <code>.npmignore<\/code> tell the package managers to include\/exclude specific files from source control when publishing this module<\/li>\n<li id=\"c2ca\" data-selectable-paragraph><code>node_modules<\/code> is maintained by npm and includes your project\u2019s dependencies.<\/li>\n<\/ul>\n<h2 id=\"a582\">Understanding the Main Stack in AWS Glue<\/h2>\n<\/div>\n<div>\n<p id=\"7ca4\" data-selectable-paragraph>There are multiple ways of creating resources in AWS<\/p>\n<ul>\n<li id=\"e1ee\" data-selectable-paragraph>UI-based \u2013 using the AWS cloud console<\/li>\n<li id=\"e0d0\" data-selectable-paragraph>Terminal-based \u2013 using AWS CDK<\/li>\n<li id=\"a04e\" data-selectable-paragraph>Programmatic way \u2013 we can also create\/delete\/update AWS resources programmatically using the SDKs<\/li>\n<\/ul>\n<p id=\"77c7\" data-selectable-paragraph>In this use case, we will use typescript SDK for developing the resources and python for the glue script. (Glue supports python and scala currently). Let\u2019s take a look at how we are creating key resources in the main stack<\/p>\n<h4 id=\"a81f\">Creating an S3 bucket which will be used to store the processed files<\/h4>\n<pre>const etl_bucket=new Bucket(this,'glue-etl-bucket',{<\/pre>\n<div>\n<pre>bucketName:StackConfiguration.bucketName, removalPolicy:RemovalPolicy.DESTROY, enforceSSL:true });<\/pre>\n<\/div>\n<h4>Creating a Glue crawler that helps to discover the schema<\/h4>\n<pre>\/create glue crawler const glue_crawler = new glue.CfnCrawler(this, \"glue-crawler-dynamoDB\", { name: \"glue-dynamo-crawler\", role: glue_service_role.roleName, targets: { dynamoDbTargets:[ { path:sourcetable.tableName } ] }, databaseName: StackConfiguration.glueCatlogDBName, tablePrefix:'demo-', schemaChangePolicy: { updateBehavior: \"UPDATE_IN_DATABASE\", deleteBehavior: \"DEPRECATE_IN_DATABASE\", }, });\nCreating a dynamo DB table which will be used as the source for this use case<\/pre>\n<p class>\/\/ creating the source table<\/p>\n<pre> const sourcetable=new dynamodb.Table(this,'etl-glue-source',{<\/pre>\n<pre> partitionKey :{name:\"policy_id\",type:dynamodb.AttributeType.STRING}, sortKey:{name:\"age_of_car\",type:dynamodb.AttributeType.STRING}, tableName:\"glue-etl-demo-source\", removalPolicy:RemovalPolicy.DESTROY })<\/pre>\n<h4 id=\"7ceb\">Creating a Glue job<\/h4>\n<pre>\/\/create glue job const etl_glue_job=new glue.CfnJob(this,'glue-etl-demo-job',{ role:glue_service_role.roleArn, command:{ name:'glue-etl', scriptLocation:f_pyAssetETL.s3ObjectUrl, pythonVersion:'3.9', }, defaultArguments:job_params, description:'Sample Glue Processing Job from DynamoDB to S3', name:StackConfiguration.glueJobName, glueVersion:'3.0', workerType:'G.1X', numberOfWorkers:2, timeout:5, maxRetries:0 }) }\n}<\/pre>\n<p id=\"b2f3\" data-selectable-paragraph>Note: for simplicity, we are not creating any triggers here and will run the job manually from the console. In practice, we can create time-based scheduled triggers which will run automatically like a cron job.<\/p>\n<h2 id=\"a582\">Understanding the PySpark Script<\/h2>\n<p class>let\u2019s now look at the operations we are performing in the Glue job using the PySpark Script. The data we have used is from a sample of auto insurance claims.<\/p>\n<p class>We are performing the below operations here.<\/p>\n<p class>1. <b>Reading the data from the source Dynamo DB table <\/b>\u2013 The first operation we are doing here is reading the data from the source Dynamo DB table, which mocks Insurance claims data. Note the database table name and other metadata are passed via environmental variables.<\/p>\n<p class>2. <b>Converting the dynamic frame to a PySpark data frame<\/b> \u2013 By default, when we read the data using the AWS Glue API, it creates a dynamic frame. We need to convert it into the PySpark data frame for easy data manipulation.<\/p>\n<p class>3. <b>Keeping only the necessary columns for our reporting needs<\/b> \u2013 The next operation we are doing is removing unwanted columns. Let\u2019s assume we need only a few columns to generate the report from the insurance claim table. This step ensures that we are only considering the necessary attributes.<\/p>\n<p class>4.&nbsp; <b>Filter the data age of the cars\u2019 column<\/b> \u2013 While working on a specific ETL use-case, we always do not need the entire data. It\u2019s a good practice to select only the necessary information. We have sorted and filtered the data frame in these steps using PySpark\u2019s sorting and filtering functionality. We have assumed a threshold of 0.7 for the&nbsp; \u2018age of cars\u2019 column.<\/p>\n<p class>5. <b>Writing the data into an S3 bucket<\/b> \u2013 Once we have extracted and transformed the data based on our business needs, we need to write it back into an S3 bucket. To achieve that, there are two operations we need to perform<\/p>\n<\/div>\n<blockquote>\n<ul>\n<li>Convert to PySpark dataframe back to dynamic frame<\/li>\n<li>Write the dynamic frame to S3 bucket using the \u2018write dynamic frame\u2019 API<\/li>\n<\/ul>\n<\/blockquote>\n<div>\n<div>\n<pre># create dynamic frame from the database table\ndatasource0 = glueContext.create_dynamic_frame.from_catalog(database = args['DATABASE_NAME'], table_name = args['TABLE_NAME'], transformation_ctx = \"datasource0\")\n# convert the dynamic frame to pyspark dataframe\ndata_frame = datasource0.toDF()\n# select a subset of columns\ndata_frame=data_frame[[\"policy_id\",\"policy_tenure\",\"make\",\"model\",\"age_of_car\",\"is_claim\"]]\n# sort of dataframe by age of car\ndata_frame=data_frame.orderBy(col(\"age_of_car\").desc())\n# filter the cars - choose only those where age&lt;0.7\ndata_frame=data_frame.filter(data_frame.age_of_car&lt;0.7)\ndata_frame = data_frame.repartition(1)\n# convert the pyspark dataframe back to dynamic frame\ndynamic_frame_write = DynamicFrame.fromDF(data_frame, glueContext, \"dynamic_frame_write\")\n# write the data to s3\ns3bucket_node=glueContext.write_dynamic_frame.from_options( frame=dynamic_frame_write, connection_type=\"s3\", format=\"csv\", connection_options={\"path\":args['BUCKET_PATH']}, transformation_ctx=\"S3bucket_node\"\n)<\/pre>\n<\/div>\n<p class>The full code is available at <a href=\"https:\/\/editor.analyticsvidhya.com\/Github\" target=\"_blank\" rel=\"noopener\">https:\/\/github.com\/arpan65\/aws-serverless-etl<\/a><\/p>\n<h2 id=\"bb9c\">How Can We Deploy the Code to AWS Glue?<\/h2>\n<\/div>\n<div>\n<p id=\"c676\" data-selectable-paragraph>First things First \u2013 Please note, before proceeding further, we need to have an AWS account (Free tier is sufficient) and configure AWS CLI (refer to this document <a href=\"https:\/\/docs.aws.amazon.com\/cli\/latest\/userguide\/cli-chap-configure.html\" target=\"_blank\" rel=\"nofollow noopener\">https:\/\/docs.aws.amazon.com\/cli\/latest\/userguide\/cli-chap-configure.html<\/a>)<\/p>\n<p id=\"4402\" data-selectable-paragraph>Now we will learn about two important CLI commands<\/p>\n<ul>\n<li id=\"ae85\" data-selectable-paragraph><code>cdk bootstrap<\/code> When we are deploying a CDK application for the first time, we need to use this command which initiates the necessary resources for the CDK toolkit\u2019s operation<\/li>\n<\/ul>\n<figure>\n<div role=\"button\">\n<div><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter\" role=\"presentation\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark-1.png\" alt=\"AWS Glue\" width=\"700\" height=\"107\"><\/div>\n<\/div>\n<\/figure>\n<ul>\n<li id=\"fc31\" data-selectable-paragraph><code>cdk synth<\/code> As mentioned earliercode we have written in our stack is just the definition of the AWS resources. When we use this command CDK applications are synthesized. i.e, cloud formation templates are generated<\/li>\n<\/ul>\n<figure>\n<div role=\"button\">\n<div><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter\" role=\"presentation\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark-2.png\" alt=\"CODE OUTPUT\" width=\"700\" height=\"280\"><\/div>\n<\/div>\n<\/figure>\n<ul>\n<li id=\"7f2e\" data-selectable-paragraph><code>cdk deploy<\/code> When executed, this command deploys the cloud formation changeset to aws<\/li>\n<\/ul>\n<figure>\n<div role=\"button\">\n<div><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter\" role=\"presentation\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark-3.png\" alt=\"CODE OUTPUT\" width=\"700\" height=\"118\"><\/div>\n<\/div>\n<\/figure>\n<p id=\"e5f9\" data-selectable-paragraph>Once the deployment is completed, navigate to the AWS console; you should see the stack there. Please make sure you are looking into the proper region.<\/p>\n<figure>\n<div role=\"button\">\n<div><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter\" role=\"presentation\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark-4.png\" alt=\"AWS Glue\" width=\"700\" height=\"181\"><\/div>\n<\/div>\n<\/figure>\n<h2 id=\"636d\">How to Test the Pipeline?<\/h2>\n<\/div>\n<div>\n<p>1. Create sample data for dynamo DB using the schema of the lib\/sample data\/sample.csv file.<\/p>\n<p>2. Navigate to the Glue service from the AWS console and click on the jobs; you should able to see a new job present there.<\/p>\n<figure>\n<div role=\"button\">\n<div><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter\" role=\"presentation\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark-5.png\" alt=\"AWS Glue\" width=\"700\" height=\"105\"><\/div>\n<\/div>\n<\/figure>\n<p>3. Click on the job and execute it.<\/p>\n<figure>\n<div role=\"button\">\n<div><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter\" role=\"presentation\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark-6.png\" alt=\"AWS Glue\" width=\"700\" height=\"275\"><\/div>\n<\/div>\n<\/figure>\n<p>4. Once the job completes successfully, navigate to the S3 bucket. The final formatted file will be present there.<\/p>\n<p id=\"6be3\" data-selectable-paragraph>Please take a look at the full demo for a better understanding :<\/p>\n<p id=\"1cc7\" data-selectable-paragraph><a href=\"https:\/\/vimeo.com\/677054610\" target=\"_blank\" rel=\"noopener ugc nofollow\">https:\/\/vimeo.com\/677054610<\/a><\/p>\n<h2 id=\"d816\">Conclusion<\/h2>\n<\/div>\n<div>\n<p id=\"3125\" data-selectable-paragraph>AWS Glue has made the game up for serverless ETL solutions and makes it easier to develop and maintain data pipelines easily. let\u2019s summarize our learnings from this article<\/p>\n<ul>\n<li>We have demonstrated a common ETL use case of the Glue and PySpark framework, widely used in the industry across various domains.<\/li>\n<li>We have learned how to define and manage AWS resources (DynamoDB, Glue Jobs, Crawlers, IAM roles, etc. ) programmatically using the AWS typescript SDK and performing the basic data manipulation operations using PySpark in python.<\/li>\n<li>As part of this use case, we have learned how to bootstrap and deploy the AWS resources using the CLI.<\/li>\n<li>We have learned how to configure a Glue job to run the PySpark script and how we can easily transform and move the operational data from the Dynamo DB table to S3 without the need to provision and maintain any server.<\/li>\n<\/ul>\n<p id=\"3125\" data-selectable-paragraph>If you are looking for hands-on experience in AWS, I will encourage you to try it out on your own and configure the job so that it runs on a schedule (well, that sounds like homework \ud83d\ude0a ). Happy learning.<\/p>\n<p id=\"b10a\"><strong>Citation<\/strong><\/p>\n<ol>\n<li id=\"237d\" data-selectable-paragraph><a href=\"https:\/\/cdkworkshop.com\/20-typescript.html\" target=\"_blank\" rel=\"noopener ugc nofollow\">TypeScript Workshop<\/a><\/li>\n<li id=\"424a\" data-selectable-paragraph><a href=\"https:\/\/docs.aws.amazon.com\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">https:\/\/docs.aws.amazon.com\/<\/a><\/li>\n<\/ol>\n<p><b>The media shown in this article is not owned by Analytics Vidhya and is used at the Author\u2019s discretion.<\/b><\/p>\n<\/div>\n<p><h3 class=\"jp-relatedposts-headline\"><em>Related<\/em><\/h3>\n<\/p>\n<ul class=\"plato-post-bottom-links\">\n<li class=\"plato-post-bottom-link-amplifi\">SEO Powered Content &amp; PR Distribution. <a href=\"https:\/\/www.amplifipr.com\" target=\"_blank\" rel=\"noopener\">Get Amplified Today.<\/a><\/li>\n<li class=\"plato-post-bottom-link-platoblockchain\">Platoblockchain. Web3 Metaverse Intelligence. Knowledge Amplified. <a href=\"https:\/\/platoblockchain.com\" target=\"_blank\" rel=\"noopener\">Access Here.<\/a><\/li>\n<li class=\"plato-post-bottom-link-source\"><span>Source:<\/span> <a href=\"https:\/\/www.analyticsvidhya.com\/blog\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.analyticsvidhya.com\/blog\/2022\/12\/crafting-serverless-etl-pipeline-using-aws-glue-and-pyspark\/<\/a><\/li>\n<\/ul>\n","protected":false},"author":1,"featured_media":1786302,"template":"Default","meta":{"_eb_attr":"","type":"","auto_type":false,"post":"","stream":"","stream_url":"","waveform_data":[],"duration":0,"start":0,"end":0,"bpm":0,"downloadable":false,"download_url":"","purchase_title":"","purchase_url":"","post-count-all":0,"like_count":0,"download_count":0,"editor_note":"","copyright":"","captions":[],"sources":[]},"genre":[42022],"station_tag":[21579,10944,3629,4262,13775,31352,9160,9080,4263,12626,9087,13519,12467,4735,9372,4046,18340,34086,4604,4047,3681,4135,41104,11007,5128,13342,9837,5717,19984,39877,30993,40025,12619,12341,14383,13655,16785,4051,17194,4526,4053,40005,4144,4555,14043,11474,12948,4726,4297,3720,5644,17500,11364,4442,21014,33521,5256,13213,40107,8767,9370,9883,40028,10629,11928,10736,4152,11927,3642,42040,7359,4898,5226,9831,36453,10905,10420,9095,9627,10582,14273,4934,12134,4248,9770,9163,13352,11538,9081,7395,34237,39854,4611,9423,9366,40011,9227,4650,9001,4445,40496,24740,39856,4652,12620,40232,8066,40638,40057,30957,18806,34902,12637,40166,3772,3773,5175,14385,11180,4070,4742,13776,4178,40113,4691,9839,11103,3902,12012,9007,4491,12472,12972,9605,9418,9168,3732,3694,4185,12342,18595,3650,4078,9365,19265,11736,3953,4009,3908,3909,42622,4188,29903,3653,4537,4596,3805,41338,3806,4318,4573,4010,4965,12523,13214,9169,11365,27204,10849,13787,13777,11042,3737,11796,10842,11622,10384,18392,9377,10726,4353,4477,11767,39495,4094,41087,4995,3706,10413,11457,23391,13663,11673,39917,12636,20482,14094,12065,4280,11941,4207,11804,40419,4100,3663,11453,10683,12667,12344,40175,33922,40330,12019,27195,7457,8453,9642,8570,14051,11181,9876,9925,10902,4323,10547,9221,9364,11810,3743,40605,9054,9057,39939,4545,15313,15551,4502,13484,11837,3744,21218,9836,4112,24570,14866,13274,40791,3921,9717,41689,40227,14277,41308,6196,6391,9768,3976,39869,12345,40811,3779,10951,4223,11458,39712,31405,32259,13785,10589,21389,31196,4901,4032,4261,9489,40075,39898,40835,4123,4125,40914,39844,40600,4127,5048,12071,4128,13215,4834,28675,13284,12532,40688,3671,19267,13355,8813,4895,40787,9006,4129,13285,8067,40077,12348,15915,12068,39925,10367,39378,11001,12835,39873,13218,36520,11454,40511,27168,9089,3935,9608,4240,40104,40240,13217,14555],"artist":[42023],"mood":[],"activity":[],"_links":{"self":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/station\/1786301"}],"collection":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/station"}],"about":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/types\/station"}],"author":[{"embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/users\/1"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/media\/1786302"}],"wp:attachment":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/media?parent=1786301"}],"wp:term":[{"taxonomy":"genre","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/genre?post=1786301"},{"taxonomy":"station_tag","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/station_tag?post=1786301"},{"taxonomy":"artist","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/artist?post=1786301"},{"taxonomy":"mood","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/mood?post=1786301"},{"taxonomy":"activity","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/activity?post=1786301"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}