Dataset research report
Swe Gym research report
A reproducible data report with schema notes, generated chart evidence, suggested follow-up questions, and export-ready Helix queries.
Executive Summary
SWE-Gym contains 2438 instances sourced from 11 Python repos, following SWE-Bench data collection procedure. Get started at project page github.com/SWE-Gym/SWE-Gym
Research Context
Swe Gym: 500 rows by 11 columns. These exploratory charts are generated automatically from the data - open the dataset in Helix to ask your own questions.
Data Profile
Chart Evidence
These views are generated from the dataset profile. Each chart is paired with a Helix query so it can be opened, adjusted, and exported.
Follow-Up Queries
Preview Rows
| # | instance_idtext | hints_texttext | patchtext | test_patchtext | created_attext | problem_statementtext | repotext | base_committext |
|---|---|---|---|---|---|---|---|---|
| 1 | getmoto__moto-7365 | diff --git a/moto/dynamodb/models/dynamo_type.py b/moto/dynamodb/models/dynamo_type.py --- a/moto/dynamodb/models/dynamo_type.py +++ b/moto… | diff --git a/tests/test_dynamodb/test_dynamodb_update_expressions.py b/tests/test_dynamodb/test_dynamodb_update_expressions.py --- a/tests/… | 2024-02-19 20:29:03 | DynamoDB's `update_item` performs floating-point arithmetic with mock table created via `boto3` When using `moto.mock_aws` to create a `pyt… | getmoto/moto | 7f6c9cb1deafb280fe7fcc7551c38e397f11a706 | |
| 2 | getmoto__moto-6920 | Hi @MacHu-GWU, that attribute should be calculated inside the `LayerVersion`-class: https://github.com/getmoto/moto/blob/368fa07ec35aa6806… | diff --git a/moto/awslambda/models.py b/moto/awslambda/models.py --- a/moto/awslambda/models.py +++ b/moto/awslambda/models.py @@ -371,6 +3… | diff --git a/tests/test_awslambda/test_lambda_layers.py b/tests/test_awslambda/test_lambda_layers.py --- a/tests/test_awslambda/test_lambda… | 2023-10-15 20:33:23 | Lambda publish_layer_version function failed due to the wrong implementation ## Reporting Bugs When you run ``publish_layer_version`` … | getmoto/moto | 2021e564fafcdaa701b53de49bd580c8691a5fcc |
| 3 | getmoto__moto-5876 | All good @JorisLimousin - every enhancement is useful! hi, I am interested in fixing this issue. it will be a great opportunity to fix this… | diff --git a/moto/cognitoidp/exceptions.py b/moto/cognitoidp/exceptions.py --- a/moto/cognitoidp/exceptions.py +++ b/moto/cognitoidp/except… | diff --git a/tests/test_cognitoidp/test_cognitoidp_exceptions.py b/tests/test_cognitoidp/test_cognitoidp_exceptions.py --- a/tests/test_cog… | 2023-01-24 23:37:57 | Cognito - No validation that there isn't already an existing user with the same username in admin_update_user_attributes Hi, Sorry for t… | getmoto/moto | 6d41ad72e09b49f61e54d47880f8a65026e7c0e4 |
| 4 | getmoto__moto-5085 | Hi @dkatzbuc, thanks for raising this - doesn't look like this behaviour is implemented yet. Marking it as an enhancement. | diff --git a/moto/core/responses.py b/moto/core/responses.py --- a/moto/core/responses.py +++ b/moto/core/responses.py @@ -725,20 +725,6 @@… | diff --git a/tests/test_ec2/test_instances.py b/tests/test_ec2/test_instances.py --- a/tests/test_ec2/test_instances.py +++ b/tests/test_ec… | 2022-05-01 18:07:16 | When creating ec2 instances from launch template via run_instances, the instances aren't tagged I'm using moto in pytest. I have created a … | getmoto/moto | 6b70cd1b6b1cf493b66b6fcaaea9d1041331e836 |
| 5 | getmoto__moto-6709 | The Dynamo item has `software`, but the query looks for `packages` - could that be the problem? Note that I haven't verified this in Mot… | diff --git a/moto/dynamodb/models/__init__.py b/moto/dynamodb/models/__init__.py --- a/moto/dynamodb/models/__init__.py +++ b/moto/dynamodb… | diff --git a/tests/test_dynamodb/models/test_item.py b/tests/test_dynamodb/models/test_item.py --- a/tests/test_dynamodb/models/test_item.p… | 2023-08-21 18:57:36 | DynamoDB: special characters in get_item() projection expression not handled correctly Hi! I have a nested attribute inside a dynamodb t… | getmoto/moto | 78c518ddc832a30e1cf20015bc5c3b1850a1c797 |
| 6 | getmoto__moto-7082 | diff --git a/moto/logs/exceptions.py b/moto/logs/exceptions.py --- a/moto/logs/exceptions.py +++ b/moto/logs/exceptions.py @@ -11,7 +11,8 @… | diff --git a/tests/test_logs/test_export_tasks.py b/tests/test_logs/test_export_tasks.py new file mode 100644 --- /dev/null +++ b/tests/tes… | 2023-11-30 22:25:53 | Logs: create_export_task() currently does not export any data to S3 Two reasons: - the `filter_log_events` does not check any streams (a… | getmoto/moto | 8a16a6a86286983ea0c60591edbee729219b729f |
Data Dictionary
- instance_id text
- hints_text text
- patch text
- test_patch text
- created_at text
- problem_statement text
- repo categorical
- base_commit text
- version categorical
- PASS_TO_PASS mixed
- FAIL_TO_PASS mixed
Method And Limits
- Load the catalog entry and preview rows from the processed dataset file.
- Infer numeric, categorical, time, and location fields from real columns.
- Generate a small set of defensive Plotly chart specifications from that profile.
- Expose each chart idea as a query link so the report can be rerun or exported in Helix.
This report is intentionally reproducible. It uses the local catalog metadata and generated chart specifications rather than claiming external conclusions beyond the dataset.