1
2

True

1
2
3

200
200

1
2
3

1
2

1
2

1

              precision    recall  f1-score   support

           0       0.18      0.05      0.08       249
           1       0.08      0.95      0.15       217
           2       0.00      0.00      0.00       364
           3       0.12      0.01      0.02       240
           4       0.00      0.00      0.00       275
           5       0.00      0.00      0.00       236
           6       0.00      0.00      0.00       333
           7       0.25      0.01      0.01       270
           8       0.22      0.01      0.01       260
           9       0.22      0.07      0.10       262

    accuracy                           0.09      2706
   macro avg       0.11      0.11      0.04      2706
weighted avg       0.10      0.09      0.03      2706

1

              precision    recall  f1-score   support

           0       0.26      0.48      0.33       249
           1       0.67      0.01      0.02       217
           2       0.20      0.37      0.26       364
           3       0.12      0.10      0.11       240
           4       0.18      0.08      0.11       275
           5       0.11      0.03      0.05       236
           6       0.20      0.26      0.22       333
           7       0.14      0.05      0.07       270
           8       0.18      0.22      0.20       260
           9       0.22      0.27      0.24       262

    accuracy                           0.20      2706
   macro avg       0.23      0.19      0.16      2706
weighted avg       0.22      0.20      0.17      2706

1

              precision    recall  f1-score   support

           0       0.31      0.42      0.36       249
           1       0.14      0.10      0.12       217
           2       0.23      0.27      0.25       364
           3       0.15      0.17      0.16       240
           4       0.13      0.10      0.11       275
           5       0.14      0.10      0.12       236
           6       0.23      0.23      0.23       333
           7       0.16      0.14      0.15       270
           8       0.18      0.17      0.17       260
           9       0.23      0.28      0.25       262

    accuracy                           0.20      2706
   macro avg       0.19      0.20      0.19      2706
weighted avg       0.19      0.20      0.20      2706

1

[14:04:24] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
              precision    recall  f1-score   support

           0       0.28      0.43      0.34       249
           1       0.09      0.05      0.06       217
           2       0.22      0.27      0.25       364
           3       0.16      0.17      0.16       240
           4       0.18      0.11      0.14       275
           5       0.18      0.13      0.15       236
           6       0.25      0.30      0.28       333
           7       0.17      0.13      0.14       270
           8       0.20      0.21      0.21       260
           9       0.26      0.31      0.28       262

    accuracy                           0.22      2706
   macro avg       0.20      0.21      0.20      2706
weighted avg       0.20      0.22      0.21      2706

1
2

1
2

              precision    recall  f1-score   support

           0       0.31      0.42      0.36       249
           1       0.14      0.10      0.12       217
           2       0.23      0.27      0.25       364
           3       0.15      0.17      0.16       240
           4       0.13      0.10      0.11       275
           5       0.14      0.10      0.12       236
           6       0.23      0.23      0.23       333
           7       0.16      0.14      0.15       270
           8       0.18      0.17      0.17       260
           9       0.23      0.28      0.25       262

    accuracy                           0.20      2706
   macro avg       0.19      0.20      0.19      2706
weighted avg       0.19      0.20      0.20      2706

1
2
3

[14:04:52] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:05:28] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:06:12] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:07:00] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:07:42] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:08:23] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:09:05] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:09:32] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:10:00] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:10:25] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:10:50] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:11:13] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:11:35] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:11:55] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:12:14] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:12:32] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
[14:12:50] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.

1
2

1
2

array([ 1,  1,  1,  3,  3,  1,  1, 15,  1,  2,  1,  1,  4,  3,  1,  1,  1,
        1,  6,  5,  6,  7,  5,  7,  7,  7,  1,  5,  6,  9,  5,  9,  4,  1,
       11, 12,  1, 13,  8, 15, 16, 12, 10,  5,  1, 11,  9, 17, 17,  8,  8,
       10, 15, 17, 10, 13, 10, 17, 10, 14,  9, 11, 14, 14, 17, 15,  9,  8,
       16, 14, 13, 15, 12, 12, 16,  7, 13, 12, 16, 14, 16,  4,  1,  1, 11,
        4,  3,  8,  3, 11,  1,  1,  6,  6,  4, 13])

1
2

array(['size', 'is_reviewer', 'is_approver', 'created_at_day',
       'created_at_month', 'created_at_weekday', 'created_at_hour',
       'change_in_.github', 'change_in_docs', 'change_in_pkg',
       'change_in_test', 'change_in_vendor', 'change_in_root',
       'changed_files_number', 'body_size', 'num_prev_merged_prs',
       'commits_number', 'filetype_.go', 'filetype_.json', 'filetype_.1',
       'filetype_.sh', 'filetype_.md', 'filetype_.yaml', 'filetype_BUILD',
       'filetype_.proto', 'filetype_.yml', 'filetype_.html',
       'filetype_.adoc', 'filetype_Dockerfile', 'filetype_LICENSE',
       'filetype_Makefile', 'filetype_.txt', 'filetype_.gitignore',
       'filetype_oc', 'filetype_.s', 'filetype_.mod',
       'filetype_openshift', 'filetype_.sum', 'filetype_.conf',
       'filetype_.bats', 'filetype_.feature', 'filetype_.xml',
       'filetype_.crt', 'filetype_.spec', 'filetype_.template',
       'filetype_AUTHORS', 'filetype_.service', 'filetype_.key',
       'filetype_run', 'filetype_.mk', 'filetype_oadm', 'filetype_.rhel',
       'filetype_.cert', 'filetype_result', 'filetype_MAINTAINERS',
       'filetype_README', 'filetype_NOTICE', 'filetype_.c',
       'filetype_.bash', 'filetype_.pl', 'filetype_Vagrantfile',
       'filetype_.centos7', 'filetype_CONTRIBUTORS', 'filetype_.gz',
       'filetype_cert', 'filetype_key', 'filetype_.files_generated_oc',
       'filetype_Readme', 'filetype_.empty', 'filetype_PATENTS',
       'filetype_.files_generated_openshift', 'filetype_.sec',
       'filetype_VERSION', 'filetype_.ini', 'filetype_.mailmap',
       'filetype_.markdown', 'filetype_test', 'filetype_.sysconfig',
       'filetype_.yaml-merge-patch', 'filetype_.gitattributes',
       'filetype_.signature', 'title_wordcount_add',
       'title_wordcount_bug', 'title_wordcount_bump',
       'title_wordcount_diagnostics', 'title_wordcount_disable',
       'title_wordcount_fix', 'title_wordcount_haproxy',
       'title_wordcount_oc', 'title_wordcount_publishing',
       'title_wordcount_revert', 'title_wordcount_router',
       'title_wordcount_sh', 'title_wordcount_staging',
       'title_wordcount_support', 'title_wordcount_travis'], dtype=object)

1
2

array([ 0, 26, 36, 91, 90, 44, 17, 16, 15, 33, 83, 14, 10,  1,  8, 82,  6,
        2, 11,  5,  9,  3,  4, 88, 86, 13, 85, 94, 81, 32, 12, 30, 27, 19,
       22, 43, 93, 28, 18, 92, 20, 21, 23, 24, 75, 25, 87, 49, 50, 67, 38,
       31, 60, 66, 29, 46, 54, 56, 58, 51, 42, 89, 84, 61, 34, 45, 35, 41,
       73, 77, 72, 76, 95, 70, 37, 55, 62, 63, 79, 59, 69, 65, 71, 52, 39,
        7, 80, 68, 40, 74, 78, 64, 57, 53, 48, 47])

1
2

array([ 1,  1,  1,  1,  1,  1,  1,  1,  1,  1,  1,  1,  1,  1,  1,  1,  1,
        1,  1,  1,  2,  3,  3,  3,  3,  3,  4,  4,  4,  4,  4,  5,  5,  5,
        5,  5,  6,  6,  6,  6,  6,  7,  7,  7,  7,  7,  8,  8,  8,  8,  8,
        9,  9,  9,  9,  9, 10, 10, 10, 10, 10, 11, 11, 11, 11, 11, 12, 12,
       12, 12, 12, 13, 13, 13, 13, 13, 14, 14, 14, 14, 14, 15, 15, 15, 15,
       15, 16, 16, 16, 16, 16, 17, 17, 17, 17, 17])

1
2

array(['size', 'filetype_.html', 'filetype_openshift',
       'title_wordcount_router', 'title_wordcount_revert',
       'filetype_.template', 'filetype_.go', 'commits_number',
       'num_prev_merged_prs', 'filetype_oc', 'title_wordcount_bump',
       'body_size', 'change_in_test', 'is_reviewer', 'change_in_docs',
       'title_wordcount_bug', 'created_at_hour', 'is_approver',
       'change_in_vendor', 'created_at_weekday'], dtype=object)

1
2
3

1
2
3

              precision    recall  f1-score   support

           0       0.17      0.54      0.26       249
           1       0.10      0.35      0.16       217
           2       0.26      0.06      0.10       364
           3       0.12      0.12      0.12       240
           4       0.28      0.02      0.03       275
           5       0.14      0.03      0.05       236
           6       0.19      0.14      0.16       333
           7       0.00      0.00      0.00       270
           8       0.20      0.15      0.17       260
           9       0.22      0.26      0.24       262

    accuracy                           0.16      2706
   macro avg       0.17      0.17      0.13      2706
weighted avg       0.17      0.16      0.13      2706

1
2
3

              precision    recall  f1-score   support

           0       0.24      0.47      0.32       249
           1       0.40      0.03      0.05       217
           2       0.20      0.32      0.24       364
           3       0.11      0.11      0.11       240
           4       0.16      0.08      0.11       275
           5       0.12      0.05      0.07       236
           6       0.24      0.24      0.24       333
           7       0.18      0.12      0.14       270
           8       0.14      0.15      0.14       260
           9       0.21      0.29      0.25       262

    accuracy                           0.19      2706
   macro avg       0.20      0.19      0.17      2706
weighted avg       0.20      0.19      0.17      2706

1
2
3

              precision    recall  f1-score   support

           0       0.25      0.31      0.28       249
           1       0.16      0.14      0.15       217
           2       0.21      0.22      0.21       364
           3       0.13      0.14      0.14       240
           4       0.13      0.11      0.12       275
           5       0.16      0.14      0.15       236
           6       0.20      0.20      0.20       333
           7       0.18      0.16      0.17       270
           8       0.17      0.17      0.17       260
           9       0.22      0.26      0.24       262

    accuracy                           0.19      2706
   macro avg       0.18      0.19      0.18      2706
weighted avg       0.18      0.19      0.18      2706

1
2
3

[14:13:31] WARNING: ../src/learner.cc:1095: Starting in XGBoost 1.3.0, the default evaluation metric used with the objective 'multi:softprob' was changed from 'merror' to 'mlogloss'. Explicitly set eval_metric if you'd like to restore the old behavior.
              precision    recall  f1-score   support

           0       0.23      0.39      0.29       249
           1       0.12      0.06      0.07       217
           2       0.20      0.24      0.22       364
           3       0.17      0.17      0.17       240
           4       0.10      0.07      0.08       275
           5       0.17      0.12      0.14       236
           6       0.26      0.32      0.28       333
           7       0.16      0.13      0.14       270
           8       0.14      0.13      0.14       260
           9       0.24      0.26      0.25       262

    accuracy                           0.20      2706
   macro avg       0.18      0.19      0.18      2706
weighted avg       0.18      0.20      0.18      2706

1

Pipeline(steps=[('scale', PowerTransformer()),
                ('rf',
                 RandomForestClassifier(max_features=0.75, n_estimators=200,
                                        n_jobs=-1, random_state=42))])

1
2

              precision    recall  f1-score   support

           0       0.31      0.42      0.36       249
           1       0.14      0.10      0.12       217
           2       0.23      0.27      0.25       364
           3       0.15      0.17      0.16       240
           4       0.13      0.10      0.11       275
           5       0.14      0.10      0.12       236
           6       0.23      0.23      0.23       333
           7       0.16      0.14      0.15       270
           8       0.18      0.17      0.17       260
           9       0.23      0.28      0.25       262

    accuracy                           0.20      2706
   macro avg       0.19      0.20      0.19      2706
weighted avg       0.19      0.20      0.20      2706

Predicting Time to Merge of a Pull Request¶

Scale data¶

Define Training and Evaluation Pipeline¶

Define Models and Parameters¶

Gaussian Naive Bayes¶

SVM¶

Random Forest¶

XGBoost¶

Compare Model Results¶

Train using all features¶

Train using pruned features¶

Create sklearn Pipeline¶

Write Model to S3¶

Conclusion¶