{"id":1002646,"date":"2021-07-30T12:35:05","date_gmt":"2021-07-30T16:35:05","guid":{"rendered":"http:\/\/875c638a55c13712f0f4f9a72b0735e1aea3d0fe"},"modified":"2021-07-30T12:35:05","modified_gmt":"2021-07-30T16:35:05","slug":"analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker","status":"publish","type":"station","link":"https:\/\/platodata.io\/plato-data\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker\/","title":{"rendered":"Analyze customer churn probability using call transcription and customer profiles with Amazon SageMaker"},"content":{"rendered":"\n<p>Regardless of the industry or product, customers are the most important component in a business\u2019s success and growth. Businesses go to great lengths to acquire and more importantly retain their existing customers. Customer satisfaction links directly to revenue growth, business credibility, and reputation. These are all key factors in a sustainable and long-term business growth strategy.<\/p>\n<p>Given the marketing and operational costs of customer acquisition and satisfaction, and how costly losing a customer to a competitor can be, generally it\u2019s less costly to retain new customers. Therefore, it\u2019s crucial for businesses to understand why and when a customer might stop using their services or switch to a competitor, so they can take proactive measures by providing incentives or offering upgrades for new packages that could encourage the customer to stay with the business.<\/p>\n<p>Customer service interactions provide invaluable insight into the customer\u2019s opinion about the business and its services, and can be used, in addition to other quantitative factors, to enable the business to better understand the sentiment and trends of customer conversations and to identify crucial company and product feedback. Customer churn prediction using machine learning (ML) techniques can be a powerful tool for customer service and care.<\/p>\n<p>In this post, we walk you through the process of training and deploying a churn prediction model on <a href=\"https:\/\/aws.amazon.com\/sagemaker\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon SageMaker<\/a> that uses <a href=\"https:\/\/huggingface.co\/models\" target=\"_blank\" rel=\"noopener noreferrer\">Hugging Face Transformers<\/a> to find useful signals in customer-agent call transcriptions. In addition to textual inputs, we show you how to incorporate other types of data, such as numerical and categorical features in order to predict customer churn.<\/p>\n<h2>Prerequisites<\/h2>\n<p>To try out the solution in your own account, make sure that you have the following in place:<\/p>\n<p><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-26575\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker.png\" alt width=\"1392\" height=\"636\"><\/a>The JumpStart solution launch creates the resources properly set up and configured to successfully run the solution.<\/p>\n<h2>Architecture overview<\/h2>\n<p>In this solution, we focus on SageMaker components. We use SageMaker training jobs to train the churn prediction model and a SageMaker endpoint to deploy the model. We use <a href=\"https:\/\/aws.amazon.com\/s3\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Simple Storage Service<\/a> (Amazon S3) to store the training data and model artifacts, and <a href=\"https:\/\/aws.amazon.com\/cloudwatch\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon CloudWatch<\/a> to log training and endpoint outputs. The following figure illustrates the architecture for the solution.<\/p>\n<p><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-26576\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-1.png\" alt width=\"1333\" height=\"769\"><\/a><\/p>\n<h2>Exploring the data<\/h2>\n<p>In this post, we use a mobile operator\u2019s historical records of which customers ended up churning and which continued using the service. The data also includes transcriptions of the latest phone call conversations between the customer and the agent (which could also be the streaming transcription as the call is happening). We can use this historical information to train an ML classifier model, which we can then use to predict the probability of customer churn based on the customer\u2019s profile information and the content of the phone call transcription. We create a SageMaker endpoint to make real-time predictions using the model and provide more insight to customer service agents as they handle customer phone calls.<\/p>\n<p>The dataset we use is synthetically generated and available under the CC BY 4.0 license. The data used to generate the numerical and categorical features is based on the public dataset <a href=\"https:\/\/www.kdd.org\/kdd-cup\/view\/kdd-cup-2009\" target=\"_blank\" rel=\"noopener noreferrer\">KDD Cup 2009: Customer relationship prediction<\/a>. We have generated over 50,000 samples and randomly split the data into 45,000 samples for training and 5,000 samples for testing. In addition, the phone conversation transcripts were synthetically generated using the GPT2 (<a href=\"https:\/\/cdn.openai.com\/research-covers\/language-unsupervised\/language_understanding_paper.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">Generative Pre-trained Transformer 2<\/a>) algorithm. The data is hosted on Amazon S3.<\/p>\n<p>More details on customer churn classification models using similar data, and also step-by-step instructions on how to build a binary classifier model using similar data, can be found in the blog post <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/predicting-customer-churn-with-amazon-machine-learning\/\" target=\"_blank\" rel=\"noopener noreferrer\">Predicting Customer Churn with Amazon Machine Learning<\/a>. That post is focused more on binary classification using the tabular data. This blog post approaches this problem from a different perspective, and brings in natural language processing (NLP) by processing the context of agent-customer phone conversations.<\/p>\n<p>The following are the attributes (features) of the customer profiles dataset:<\/p>\n<ul>\n<li><strong>CustServ Calls<\/strong> \u2013 The number of calls placed to customer service<\/li>\n<li><strong>State:<\/strong> The US state in which the customer resides, indicated by a two-letter abbreviation; for example, OH or NJ<\/li>\n<li><strong>VMail Message <\/strong>\u2013 The average number of voice mail messages per month<\/li>\n<li><strong>Account Length <\/strong>\u2013 The number of days that this account has been active<\/li>\n<li><strong>Day Mins, Day Calls, Day Charge <\/strong>\u2013 The billed cost for calls placed during the day<\/li>\n<li><strong>Eve Mins, Eve Calls, Eve Charge <\/strong>\u2013 The billed cost for calls placed during the evening<\/li>\n<li><strong>Night Mins, Night Calls, Night Charge <\/strong>\u2013 The billed cost for calls placed during nighttime<\/li>\n<li><strong>Intl Mins, Intl Calls, Intl Charge <\/strong>\u2013 The billed cost for international calls<\/li>\n<li><strong>Location <\/strong>\u2013 Whether the customer is located in urban, suburban, rural, or other areas<\/li>\n<li><strong>State <\/strong>\u2013 The state location of the customer<\/li>\n<li><strong>Plan <\/strong>\u2013 The plan category<\/li>\n<li><strong>Limit <\/strong>\u2013 Limited or unlimited plan type<\/li>\n<li><strong>Text <\/strong>\u2013 The synthetic GPT-2 generated transcription of the customer-agent phone conversation<\/li>\n<li><strong>Y:<\/strong> Whether the customer left the service (true\/false)<\/li>\n<\/ul>\n<p>The last attribute, <code>Y<\/code>, is known as the <em>target feature<\/em>, or the feature we want the ML model to predict. Because the target feature is binary (true\/false), the type of modeling is a binary classification model. The model we train later in this post predicts the likelihood of churn as well.<\/p>\n<p>We don\u2019t go over exploratory data analysis in this post. For more details, see <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/predicting-customer-churn-with-amazon-machine-learning\/\" target=\"_blank\" rel=\"noopener noreferrer\">Predicting Customer Churn with Amazon Machine Learning<\/a> and the <a href=\"https:\/\/github.com\/aws\/amazon-sagemaker-examples\/blob\/master\/introduction_to_applying_machine_learning\/xgboost_customer_churn\/xgboost_customer_churn.ipynb\" target=\"_blank\" rel=\"noopener noreferrer\">Customer Churn Prediction with XGBoost<\/a> sample notebook.<\/p>\n<p>The training script is developed to allow the ML practitioner to pick and choose the features used in training. For example, we don\u2019t use all the features in training. We focus more on the maturity of the customer\u2019s account, number of times the customer has contacted customer service, type of plan they have, and transcription of the latest phone call. You can use additional features in training by including the list in the hyperparameters, as we show in the next section.<\/p>\n<p>The transcription of customer-agent phone call in the <code>text<\/code> column is synthetic text generated by ML models using the GPT2 algorithm. Its purpose is to show how you can apply this solution to real-world customer service phone conversations. GPT2 is an unsupervised transformer language model developed by <a href=\"https:\/\/openai.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">OpenAI<\/a>. It\u2019s a powerful generative NLP model that excels in processing long-range dependencies, and is pre-trained on a diverse corpus of text. For more details on how to generate text using GPT2, see <a href=\"https:\/\/aws.amazon.com\/blogs\/awsmarketplace\/experimenting-with-gpt-2-xl-machine-learning-model-package-on-amazon-sagemaker\/\" target=\"_blank\" rel=\"noopener noreferrer\">Experimenting with GPT-2 XL machine learning model package on Amazon SageMaker<\/a> and the <a href=\"https:\/\/github.com\/aws\/amazon-sagemaker-examples\/blob\/master\/aws_marketplace\/using_model_packages\/creative-writing-using-gpt-2-text-generation\/creative-writing-using-gpt-2-text-generation.ipynb\" target=\"_blank\" rel=\"noopener noreferrer\">Creative Writing using GPT2 Text Generation<\/a> example notebook.<\/p>\n<h2>Train the model<\/h2>\n<p>For this post, we use the <a href=\"https:\/\/sagemaker.readthedocs.io\/en\/stable\/frameworks\/pytorch\/sagemaker.pytorch.html#sagemaker.pytorch.estimator.PyTorch\" target=\"_blank\" rel=\"noopener noreferrer\">SageMaker PyTorch Estimator<\/a> to build a SageMaker estimator using an Amazon-built Docker container that runs functions defined in the supplied <code>entry_point<\/code> Python script within a SageMaker training job. The training job is started by calling <code>.fit()<\/code> on this estimator. Later, we deploy the model by calling the <code>.deploy()<\/code> method on the estimator. Visit <a href=\"https:\/\/sagemaker.readthedocs.io\/en\/stable\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon SageMaker Python SDK<\/a> technical documentation for more details on preparing PyTorch scripts for SageMaker training and using the PyTorch Estimator.<\/p>\n<p>Also, visit <a href=\"https:\/\/github.com\/aws\/deep-learning-containers\/blob\/master\/available_images.md\" target=\"_blank\" rel=\"noopener noreferrer\">Available Deep Learning Containers Images<\/a> on GitHub to get a list of supported PyTorch versions. At the time of this writing, the latest version available is PyTorch 1.8.1 with Python version 3.6. You can update the framework version to the latest supported version by changing the <code>framework_version<\/code> parameter in the PyTorch Estimator. You can also use <a href=\"https:\/\/sagemaker.readthedocs.io\/en\/stable\/api\/utility\/image_uris.html\" target=\"_blank\" rel=\"noopener noreferrer\">SageMaker utility API image URIs<\/a> to get the latest list of supported versions.<\/p>\n<p>The hyperparameters dictionary defines which features we want to use for training and also the number of trees in the forest (<code>n-estimators<\/code>) for the model. You can add any other hyperparameters for the<a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.ensemble.RandomForestClassifier.html#sklearn.ensemble.RandomForestClassifier\" target=\"_blank\" rel=\"noopener noreferrer\"> RandomForestClassifier<\/a>; however, you also need revise your custom training script to receive these parameters in the form of arguments (<a href=\"https:\/\/docs.python.org\/3\/library\/argparse.html\" target=\"_blank\" rel=\"noopener noreferrer\">using the argparse library<\/a>) and add them to your model. See the following code:<\/p>\n<div class=\"hide-language\" readability=\"28\">\n<pre><code class=\"lang-python\">hyperparameters = { \"n-estimators\": 100, \"numerical-feature-names\": \"CustServ Calls,Account Length\", \"categorical-feature-names\": \"plan,limit\", \"textual-feature-names\": \"text\", \"label-name\": \"y\"\n} estimator = PyTorch( framework_version='1.8.1', py_version='py3', entry_point='entry_point.py', source_dir='path\/to\/source\/directory', hyperparameters=hyperparameters, role=iam_role, instance_count=1, instance_type='ml.p3.2xlarge', output_path='s3:\/\/path\/to\/output\/location', code_location='s3:\/\/path\/to\/code\/location', base_job_name=base_job_name, sagemaker_session=sagemaker_session, train_volume_size=30\n)<\/code><\/pre>\n<\/p><\/div>\n<p>If you launched the SageMaker JumpStart solution in your account, the custom scripts are available in your Studio files. We use the <code>entry_point.py<\/code> script. This script receives a list of numerical features, categorical features, textual features, and the target label, and trains a <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.ensemble.RandomForestClassifier.html#sklearn-ensemble-randomforestclassifier\" target=\"_blank\" rel=\"noopener noreferrer\">SKLearn RandomForestClassifier <\/a>on the data. However, the key here is processing the features before using them in the classifier, especially the call transcription. The following figure shows this process, which applies imputing to numerical features and replaces missing values with mean, one-hot encoding to categorical features, and embeds transformers to textual features.<\/p>\n<p><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-26587 aligncenter\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-2.png\" alt width=\"400\" height=\"215\"><\/a><\/p>\n<p>The purpose of the script presented in this post is to provide an example of how you can develop your own custom feature transformation pipeline. You can apply other transformations to the data based on your specific use case and the nature of your dataset, and make it as complex or as simple as you want. For example, depending on the nature of your dataset and the results of the exploratory data analysis, you may want to consider normalization, log transformation, or dropping records with null values. For a more complete list of feature transformation techniques, visit <a href=\"https:\/\/scikit-learn.org\/stable\/data_transforms.html\" target=\"_blank\" rel=\"noopener noreferrer\">SKLearn Dataset Transformations<\/a>.<\/p>\n<p>The following code snippet shows you how to instantiate these transformers for numerical and categorical features, and how to apply them to your dataset. More details on how these are done in the training script is available in the <code>entry_point.py<\/code> script that is launched in your files by the JumpStart solution.<\/p>\n<div class=\"hide-language\" readability=\"17\">\n<pre><code class=\"lang-python\">from sklearn.impute import SimpleImputer\nfrom sklearn.preprocessing import OneHotEncoder # Instantiate transformers\nnumerical_transformer = SimpleImputer(missing_values=np.nan, strategy='mean', add_indicator=True)\ncategorical_transformer = OneHotEncoder(handle_unknown=\"ignore\") # Train transformers on data, and store transformers for future use by predict function\nnumerical_transformer.fit(numerical_features)\njoblib.dump(numerical_transformer, Path(args.model_dir, \"numerical_transformer.joblib\")) categorical_transformer.fit(categorical_features)\njoblib.dump(categorical_transformer, Path(args.model_dir, \"categorical_transformer.joblib\")) # transform the data\nnumerical_features = numerical_transformer.transform(numerical_features)\ncategorical_features = categorical_transformer.transform(categorical_features)<\/code><\/pre>\n<\/p><\/div>\n<p>Now let\u2019s focus on the textual data. We use <a href=\"https:\/\/huggingface.co\/sentence-transformers\" target=\"_blank\" rel=\"noopener noreferrer\">Hugging Face sentence transformers<\/a>, which you can use for sentence embedding generation. They come with pre-trained models that you can use out of the box based on your use case. In this post, we use the <a href=\"https:\/\/huggingface.co\/sentence-transformers\/bert-base-nli-cls-token\" target=\"_blank\" rel=\"noopener noreferrer\">bert-base-nli-cls-token <\/a>model, which is described in <a href=\"https:\/\/arxiv.org\/abs\/1908.10084\" target=\"_blank\" rel=\"noopener noreferrer\">Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks<\/a>.<\/p>\n<p>Recently, SageMaker introduced new <a href=\"https:\/\/github.com\/aws\/deep-learning-containers\/blob\/master\/available_images.md#huggingface-training-containers\" target=\"_blank\" rel=\"noopener noreferrer\">Hugging Face Deep Learning Containers (DLCs)<\/a> that enable you to train, fine-tune, and run inference using Hugging Face models for NLP on SageMaker. In this post, we use the PyTorch container and a custom training script. For this purpose, in our training script, we define a <code>BertEncoder<\/code> class based on Hugging Face <code>SentenceTransformer<\/code> and define the pre-trained model as <code>bert-base-nli-cls-token<\/code>, as shown in the following code. The reason for this is to be able to apply the transformer to the dataset in the same way as the other dataset transformers, with the applying <code>.transform()<\/code> method. The benefit of using Hugging Face pre-trained models is that you don\u2019t need to do additional training to be able to use the model. However, you can still fine-tune the models with custom data, as described in <a href=\"https:\/\/huggingface.co\/transformers\/training.html\" target=\"_blank\" rel=\"noopener noreferrer\">Fine-tuning a pretrained model<\/a>.<\/p>\n<div class=\"hide-language\" readability=\"15\">\n<pre><code class=\"lang-python\">from sentence_transformers import SentenceTransformer # Define a class for BertEncoder\nclass BertEncoder(BaseEstimator, TransformerMixin): def __init__(self, model_name='bert-base-nli-cls-token'): self.model = SentenceTransformer(model_name) self.model.parallel_tokenization = False def fit(self, X, y=None): return self def transform(self, X): output = [] for sample in X: encodings = self.model.encode(sample) output.append(encodings) return output # Instantiate the class textual_transformer = BertEncoder() # Apply the transformation to textual features\ntextual_features = textual_transformer.transform(textual_features)<\/code><\/pre>\n<\/p><\/div>\n<p>Now that the dataset is processed and ready to be consumed by an ML model, we can train any classifier model to predict if a customer will churn or not. In addition to predicting the class (0\/1 or true\/false) for customer churn, these models also generate the probability of each class, meaning the probability of a customer churning. This is particularly useful for customer service teams for strategizing the incentives or upgrades they can offer to the customer based on how likely the customer is to cancel the service or subscription. In this post, we use the <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.ensemble.RandomForestClassifier.html#sklearn-ensemble-randomforestclassifier\" target=\"_blank\" rel=\"noopener noreferrer\">SKLearn RandomForestClassifier<\/a> model. You can choose from many hyperparameters for this model and also optimize the hyperparameters for a more accurate model prediction by using strategies like grid search, random search, and Bayesian search. SageMaker <a href=\"https:\/\/docs.aws.amazon.com\/sagemaker\/latest\/dg\/automatic-model-tuning-how-it-works.html\" target=\"_blank\" rel=\"noopener noreferrer\">automatic hyperparameter tuning <\/a>can be a powerful tool for this purpose.<\/p>\n<p>Training the model in <code>entry_point.py<\/code> is handled by the <code>train_fn()<\/code> function in the custom script. This function is called when the <code>.fit()<\/code> method is applied to the estimator. This function also stores the trained model and trained data transformers on Amazon S3. These files are used later by <code>model_fn()<\/code> to load the model for inference purposes.<\/p>\n<p><code>train_fn()<\/code> also includes evaluation of the trained model, and provides accuracy scores for the model for both train and test datasets. This helps you better evaluate model performance. Because this is a classification problem, we recommend including other metrics in your evaluation script, for example <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.metrics.f1_score.html#sklearn-metrics-f1-score\" target=\"_blank\" rel=\"noopener noreferrer\">F1 score<\/a>, <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.metrics.roc_auc_score.html#sklearn-metrics-roc-auc-score\" target=\"_blank\" rel=\"noopener noreferrer\">ROC AUC score<\/a>, and <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.metrics.recall_score.html#sklearn-metrics-recall-score\" target=\"_blank\" rel=\"noopener noreferrer\">recall score<\/a>, the same way we added accuracy scores. These are printed as the training progresses. Because we\u2019re using synthetic data for training the model in this example notebook, especially for the agent-customer call transcription, we\u2019re not expecting to see high-performing models with regards to classification metrics, and therefore we\u2019re not focusing on these metrics in this example. However, when you use your own data, you should consider how each classification metric could impact the applicability of the model to your use case. Training this model on 45,000 samples on an ml.p3.2xlarge instance takes about 30 minutes.<\/p>\n<div class=\"hide-language\" readability=\"9\">\n<pre><code class=\"lang-python\">estimator.fit({ 'train': 's3:\/\/path\/to\/your\/train.jsonl')), 'test': 's3:\/\/path\/to\/your\/test.jsonl'))\n})<\/code><\/pre>\n<\/p><\/div>\n<p>When you\u2019re comfortable with the performance of your model, you can move to the next step, which is deploying your model for real-time inference.<\/p>\n<h2>Deploy the model<\/h2>\n<p>When the training is complete, you can <a href=\"https:\/\/docs.aws.amazon.com\/sagemaker\/latest\/dg\/how-it-works-deployment.html#how-it-works-hosting\" target=\"_blank\" rel=\"noopener noreferrer\">deploy the model as a SageMaker hosted endpoint<\/a> for real-time inference, or use the model for offline batch inference, using <a href=\"https:\/\/docs.aws.amazon.com\/sagemaker\/latest\/dg\/how-it-works-batch.html\" target=\"_blank\" rel=\"noopener noreferrer\">SageMaker batch transform<\/a>. The task of performing inference (either real time or batch) is handled by four main functions in the custom script:<\/p>\n<ul>\n<li><code>input_fn()<\/code> processes the input data<\/li>\n<li><code>model_fn()<\/code> loads the trained model artifacts from Amazon S3<\/li>\n<li><code>predict_fn()<\/code> makes predictions<\/li>\n<li><code>output_fn()<\/code> prepares the model output<\/li>\n<\/ul>\n<p>The following diagram illustrates this process.<\/p>\n<p><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-3.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-26578\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-3.png\" alt width=\"1276\" height=\"533\"><\/a><\/p>\n<p>The following script is a snippet of the <code>entry_point.py<\/code> script, and shows how the four functions work together to perform inference:<\/p>\n<div class=\"hide-language\" readability=\"36\">\n<pre><code class=\"lang-python\"># Model function to load the trained model and trained transformers from S3\ndef model_fn(model_dir): print('loading feature_names') numerical_feature_names, categorical_feature_names, textual_feature_names = load_feature_names(Path(model_dir, \"feature_names.json\")) print('loading numerical_transformer') numerical_transformer = joblib.load(Path(model_dir, \"numerical_transformer.joblib\")) print('loading categorical_transformer') categorical_transformer = joblib.load(Path(model_dir, \"categorical_transformer.joblib\")) print('loading textual_transformer') textual_transformer = BertEncoder() classifier = joblib.load(Path(model_dir, \"classifier.joblib\")) model_assets = { 'numerical_feature_names': numerical_feature_names, 'numerical_transformer': numerical_transformer, 'categorical_feature_names': categorical_feature_names, 'categorical_transformer': categorical_transformer, 'textual_feature_names': textual_feature_names, 'textual_transformer': textual_transformer, 'classifier': classifier } return model_assets # Input Preparation Function to receive the request body and ensure proper format\ndef input_fn(request_body_str, request_content_type): assert ( request_content_type == \"application\/json\" ), \"content_type must be 'application\/json'\" request_body = json.loads(request_body_str) return request_body # Predict function to make inference\ndef predict_fn(request, model_assets): print('making batch') request = [request] print('extracting features') numerical_features, categorical_features, textual_features = extract_features( request, model_assets['numerical_feature_names'], model_assets['categorical_feature_names'], model_assets['textual_feature_names'] ) print('transforming numerical_features') numerical_features = model_assets['numerical_transformer'].transform(numerical_features) print('transforming categorical_features') categorical_features = model_assets['categorical_transformer'].transform(categorical_features) print('transforming textual_features') textual_features = model_assets['textual_transformer'].transform(textual_features) # Concatenate Features print('concatenating features') categorical_features = categorical_features.toarray() textual_features = np.array(textual_features) textual_features = textual_features.reshape(textual_features.shape[0], -1) features = np.concatenate([ numerical_features, categorical_features, textual_features ], axis=1) print('predicting using model') prediction = model_assets['classifier'].predict_proba(features) probability = prediction[0][1].tolist() output = { 'probability': probability } return output # Output function to prepare the output\ndef output_fn(prediction, response_content_type): assert ( response_content_type == \"application\/json\" ), \"accept must be 'application\/json'\" response_body_str = json.dumps(prediction) return response_body_str<\/code><\/pre>\n<\/p><\/div>\n<p>To deploy the model, when the training is complete, we use the <code>.deploy()<\/code> method on the estimator and define the number and type of instances we want to attach to the endpoint, and SageMaker manages the infrastructure on your behalf. When calling the endpoint from the notebook, we use a SageMaker SDK <a href=\"https:\/\/sagemaker.readthedocs.io\/en\/stable\/predictors.html\" target=\"_blank\" rel=\"noopener noreferrer\">predictor<\/a>. The predictor sends data to an endpoint (as part of a request), and interprets the response. See the following code:<\/p>\n<div class=\"hide-language\" readability=\"11\">\n<pre><code class=\"lang-python\"># Deploy the predictor\npredictor = estimator.deploy( endpoint_name=endpoint_name, instance_type='ml.p3.2xlarge', initial_instance_count=1\n) predictor.serializer = JSONSerializer()\npredictor.deserializer = JSONDeserializer()<\/code><\/pre>\n<\/p><\/div>\n<p>This deploys the model as an endpoint predictor. After deployment is complete, we can use that to make predictions on sample data. Let\u2019s determine the probability of churn for a hypothetical customer:<\/p>\n<div class=\"hide-language\" readability=\"15\">\n<pre><code class=\"lang-python\">data = { \"CustServ Calls\": 10.0, \"Account Length\": 66, \"plan\": \"B\", \"limit\": \"limited\", 'text': \"Well, I've been dealing with TelCom for three months now and I am quite happy with your service\"} response = predictor.predict(data=data) print(\"{:.2%} probability of churn\".format(response['probability']))<\/code><\/pre>\n<\/p><\/div>\n<p>In this case, the probability of churn is about 31%. For the same customer, we change the transcript to \u201cI have been using your service for 6 months and I am disappointed in your customer service.\u201d The probability of churn increases to over 46%. This demonstrates that a change in the customer\u2019s sentiment affects the probability of churn.<\/p>\n<h2>Clean up<\/h2>\n<p>To clean up the resources and stop incurring charges in your account, you can delete the endpoint:<\/p>\n<div class=\"hide-language\" readability=\"7\">\n<pre><code class=\"lang-python\">predictor.delete_endpoint()<\/code><\/pre>\n<\/p><\/div>\n<h2>Extensions<\/h2>\n<p>As we explained earlier, you can use additional features in training and also incorporate more feature transformers in the feature engineering pipeline, which can help improve model performance.<\/p>\n<p>In addition, now that you have a working endpoint that is performing real-time inference, you can use it for your applications or website. However, your SageMaker endpoint is still not public facing, so you need to build an API Gateway to allow external traffic to your SageMaker endpoint. <a href=\"https:\/\/aws.amazon.com\/api-gateway\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon API Gateway<\/a> is a fully managed service that makes it easy for developers to create, publish, maintain, monitor, and secure APIs at any scale. You can use API Gateway to present an external-facing, single point of entry for SageMaker endpoints, and provide security, throttling, authentication, firewall as provided by <a href=\"https:\/\/aws.amazon.com\/waf\/\" target=\"_blank\" rel=\"noopener noreferrer\">AWS WAF<\/a>, and more. With API Gateway mapping templates, you can invoke your SageMaker endpoint with a REST API request and receive an API response back without needing any intermediate <a href=\"https:\/\/aws.amazon.com\/lambda\/\" target=\"_blank\" rel=\"noopener noreferrer\">AWS Lambda<\/a> functions, thereby improving the performance and cost-effectiveness of your applications.<\/p>\n<p>To create an API Gateway and use it to perform real-time inference with your SageMaker endpoint (see the following architecture), you can follow the instructions outlined in <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/creating-a-machine-learning-powered-rest-api-with-amazon-api-gateway-mapping-templates-and-amazon-sagemaker\/\" target=\"_blank\" rel=\"noopener noreferrer\">Creating a machine learning-powered REST API with Amazon API Gateway mapping templates and Amazon SageMaker<\/a>.<\/p>\n<p><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-4.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-26648\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-4.png\" alt width=\"400\" height=\"138\"><\/a><\/p>\n<p>In addition, you can use <a href=\"https:\/\/aws.amazon.com\/transcribe\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Transcribe<\/a> to generate transcriptions of recorded customer-agent conversations and use them for training purposes, and also use <a href=\"https:\/\/docs.aws.amazon.com\/transcribe\/latest\/dg\/streaming.html\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Transcribe streaming<\/a> to send the conversation audio stream and receive a stream of text in real time. You can use this text stream to add a real-time speech-to-text capability to your applications and also send that text to the endpoint and provide customer churn insights to your customer service agents in real time.<\/p>\n<h2>Conclusions<\/h2>\n<p>In this post, we explained an end-to-end solution for creating a customer churn prediction model based on customer profiles and customer-agent call transcriptions. The solution included training a PyTorch model with a custom script and creating an endpoint for real-time model hosting. We also explained how you can create a public-facing API Gateway that can be securely used in your mobile applications or website. In addition, we explained how you can use Amazon Transcribe for batch or real-time transcription of customer-agent conversations, which you can use for training of your model or real-time inference.<\/p>\n<p>For more SageMaker examples, visit the <a href=\"https:\/\/github.com\/aws\/amazon-sagemaker-examples\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon SageMaker Examples<\/a> GitHub repo. For more PyTorch BYO script examples, visit the following <a href=\"https:\/\/github.com\/aws\/amazon-sagemaker-examples\/tree\/35e2faf7d1cc48ccedf0b2ede1da9987a18727a5\/advanced_functionality\/mxnet_mnist_byom\" target=\"_blank\" rel=\"noopener noreferrer\">GitHub repository<\/a>. For more SageMaker Python examples for MXNet, TensorFlow, and PyTorch, visit the <a href=\"https:\/\/github.com\/aws\/amazon-sagemaker-examples\/tree\/35e2faf7d1cc48ccedf0b2ede1da9987a18727a5\/sagemaker-python-sdk\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon SageMaker Pre-Built Framework Containers and the Python SDK<\/a> GitHub repo. Additional information about SageMaker is available in the <a href=\"https:\/\/docs.aws.amazon.com\/sagemaker\/index.html\" target=\"_blank\" rel=\"noopener noreferrer\">technical documentation.<\/a><\/p>\n<hr>\n<h3>About the Author<\/h3>\n<p><strong><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-12606 alignleft\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker.jpg\" alt width=\"100\" height=\"134\"><\/a>Nick Minaie<\/strong> is an Sr AI\/ML Specialist Solutions Architect with AWS, helping customers on their journey to well-architected machine learning solutions at scale. In his spare time, Nick enjoys family time, abstract painting, and exploring nature.<\/p>\n<p><strong><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-5.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-26580 alignleft\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-5.png\" alt width=\"100\" height=\"96\"><\/a> Ehsan M. Kermani<\/strong> is a Machine Learning Engineer in the AWS ML Automation Services group. He helps customers through their MLOps journey by providing his expertise in Software Engineering best practices to solve customers\u2019 end-to-end Machine Learning tasks from infrastructure to deployment.<\/p>\n<p><strong><a href=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-26584 size-full alignleft\" src=\"https:\/\/wordpress-1016567-4521551.cloudwaysapps.com\/wp-content\/uploads\/2021\/08\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker-1.jpg\" alt width=\"100\" height=\"75\"><\/a>Dr. Li Zhang<\/strong> is a Principal Product Manager-Technical for Amazon SageMaker JumpStart and Amazon SageMaker built-in algorithms, a service that helps data scientists and machine learning practitioners get started with training and deploying their models, and uses reinforcement learning with Amazon SageMaker. His past work as a principal research staff member and master inventor at IBM Research has won the test of time paper award at IEEE INFOCOM.<\/p>\n<p> <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker\/\">Source: https:\/\/aws.amazon.com\/blogs\/machine-learning\/analyze-customer-churn-probability-using-call-transcription-and-customer-profiles-with-amazon-sagemaker\/<\/a><\/p>\n","protected":false},"author":1,"featured_media":1002647,"template":"","meta":{"_eb_attr":"","type":"","auto_type":false,"post":"","stream":"","stream_url":"","waveform_data":[],"duration":0,"start":0,"end":0,"bpm":0,"downloadable":false,"download_url":"","purchase_title":"","purchase_url":"","post-count-all":0,"like_count":0,"download_count":0,"editor_note":"","copyright":"","captions":[],"sources":[]},"genre":[10305],"station_tag":[43552,3758,3785,3759,3629,4262,4263,5590,3942,4133,4339,4044,3761,4735,16282,15796,18054,4045,4604,4896,3681,4243,6009,4523,4554,4050,5717,4139,4140,5355,4265,4791,4526,4053,4054,3886,4868,4147,4400,5253,6052,3720,5644,3830,5619,6981,7141,6978,4863,5340,4058,4152,7647,4059,3833,3642,6543,3792,4156,15624,4248,3896,7810,44455,6163,5201,4445,4741,4920,4473,3729,4382,4175,4068,4069,4177,5175,4070,4691,3950,5646,4311,4979,3847,3731,3732,11956,3694,4185,3650,4568,4078,27708,4186,3803,3953,4009,3908,3909,4080,3954,3653,4537,4596,3805,4572,3703,3734,4010,5230,4200,3911,4965,4088,4089,3737,4451,475,4093,37578,3961,5903,4094,43589,3660,3706,43581,16265,16762,3855,4280,15633,4731,3662,3663,44231,4321,4857,4102,4750,7671,4018,5125,4323,7934,3939,4682,4827,4215,5553,15313,24360,5947,7118,4109,43704,3710,4112,3776,4260,4433,3868,3921,5149,21084,4117,4580,6196,3811,4220,3812,3975,3976,4461,3779,4282,4462,4223,4392,4860,3668,4285,3780,4901,4032,4261,3928,4412,4414,3929,4034,3670,5033,34331,4675,4126,16761,4128,4361,3671,5207,7291,7308,6377,4129,3674,5101,7031,3984,3781,4589,4396,3816,3935,467,4927],"artist":[10682],"mood":[],"activity":[],"_links":{"self":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/station\/1002646"}],"collection":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/station"}],"about":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/types\/station"}],"author":[{"embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/users\/1"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/"}],"wp:attachment":[{"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/media?parent=1002646"}],"wp:term":[{"taxonomy":"genre","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/genre?post=1002646"},{"taxonomy":"station_tag","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/station_tag?post=1002646"},{"taxonomy":"artist","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/artist?post=1002646"},{"taxonomy":"mood","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/mood?post=1002646"},{"taxonomy":"activity","embeddable":true,"href":"https:\/\/platodata.io\/wp-json\/wp\/v2\/activity?post=1002646"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}