Swe Gym research report

Executive Summary

SWE-Gym contains 2438 instances sourced from 11 Python repos, following SWE-Bench data collection procedure. Get started at project page github.com/SWE-Gym/SWE-Gym

Finding 1The dataset has 500 rows available in the catalog.

Finding 2The catalog exposes 11 documented or inferred columns.

Finding 3Helix has 4 ready query prompts for this dataset.

Finding 4This report includes 1 generated chart views.

Research Context

Swe Gym: 500 rows by 11 columns. These exploratory charts are generated automatically from the data - open the dataset in Helix to ask your own questions.

Data Profile

Rows500

Columns11

Categorical cols2

Chart Evidence

These views are generated from the dataset profile. Each chart is paired with a Helix query so it can be opened, adjusted, and exported.

Chart 1

repo by record count

Most common repo values across records.

file_download Open and export this chart

Follow-Up Queries

Preview Rows

#	instance_idtext	hints_texttext	patchtext	test_patchtext	created_attext	problem_statementtext	repotext	base_committext
1	getmoto__moto-7365		diff --git a/moto/dynamodb/models/dynamo_type.py b/moto/dynamodb/models/dynamo_type.py --- a/moto/dynamodb/models/dynamo_type.py +++ b/moto…	diff --git a/tests/test_dynamodb/test_dynamodb_update_expressions.py b/tests/test_dynamodb/test_dynamodb_update_expressions.py --- a/tests/…	2024-02-19 20:29:03	DynamoDB's `update_item` performs floating-point arithmetic with mock table created via `boto3` When using `moto.mock_aws` to create a `pyt…	getmoto/moto	7f6c9cb1deafb280fe7fcc7551c38e397f11a706
2	getmoto__moto-6920	Hi @MacHu-GWU, that attribute should be calculated inside the `LayerVersion`-class: https://github.com/getmoto/moto/blob/368fa07ec35aa6806…	diff --git a/moto/awslambda/models.py b/moto/awslambda/models.py --- a/moto/awslambda/models.py +++ b/moto/awslambda/models.py @@ -371,6 +3…	diff --git a/tests/test_awslambda/test_lambda_layers.py b/tests/test_awslambda/test_lambda_layers.py --- a/tests/test_awslambda/test_lambda…	2023-10-15 20:33:23	Lambda publish_layer_version function failed due to the wrong implementation ## Reporting Bugs When you run ``publish_layer_version`` …	getmoto/moto	2021e564fafcdaa701b53de49bd580c8691a5fcc
3	getmoto__moto-5876	All good @JorisLimousin - every enhancement is useful! hi, I am interested in fixing this issue. it will be a great opportunity to fix this…	diff --git a/moto/cognitoidp/exceptions.py b/moto/cognitoidp/exceptions.py --- a/moto/cognitoidp/exceptions.py +++ b/moto/cognitoidp/except…	diff --git a/tests/test_cognitoidp/test_cognitoidp_exceptions.py b/tests/test_cognitoidp/test_cognitoidp_exceptions.py --- a/tests/test_cog…	2023-01-24 23:37:57	Cognito - No validation that there isn't already an existing user with the same username in admin_update_user_attributes Hi, Sorry for t…	getmoto/moto	6d41ad72e09b49f61e54d47880f8a65026e7c0e4
4	getmoto__moto-5085	Hi @dkatzbuc, thanks for raising this - doesn't look like this behaviour is implemented yet. Marking it as an enhancement.	diff --git a/moto/core/responses.py b/moto/core/responses.py --- a/moto/core/responses.py +++ b/moto/core/responses.py @@ -725,20 +725,6 @@…	diff --git a/tests/test_ec2/test_instances.py b/tests/test_ec2/test_instances.py --- a/tests/test_ec2/test_instances.py +++ b/tests/test_ec…	2022-05-01 18:07:16	When creating ec2 instances from launch template via run_instances, the instances aren't tagged I'm using moto in pytest. I have created a …	getmoto/moto	6b70cd1b6b1cf493b66b6fcaaea9d1041331e836
5	getmoto__moto-6709	The Dynamo item has `software`, but the query looks for `packages` - could that be the problem? Note that I haven't verified this in Mot…	diff --git a/moto/dynamodb/models/__init__.py b/moto/dynamodb/models/__init__.py --- a/moto/dynamodb/models/__init__.py +++ b/moto/dynamodb…	diff --git a/tests/test_dynamodb/models/test_item.py b/tests/test_dynamodb/models/test_item.py --- a/tests/test_dynamodb/models/test_item.p…	2023-08-21 18:57:36	DynamoDB: special characters in get_item() projection expression not handled correctly Hi! I have a nested attribute inside a dynamodb t…	getmoto/moto	78c518ddc832a30e1cf20015bc5c3b1850a1c797
6	getmoto__moto-7082		diff --git a/moto/logs/exceptions.py b/moto/logs/exceptions.py --- a/moto/logs/exceptions.py +++ b/moto/logs/exceptions.py @@ -11,7 +11,8 @…	diff --git a/tests/test_logs/test_export_tasks.py b/tests/test_logs/test_export_tasks.py new file mode 100644 --- /dev/null +++ b/tests/tes…	2023-11-30 22:25:53	Logs: create_export_task() currently does not export any data to S3 Two reasons: - the `filter_log_events` does not check any streams (a…	getmoto/moto	8a16a6a86286983ea0c60591edbee729219b729f

Data Dictionary

instance_id text
hints_text text
patch text
test_patch text
created_at text
problem_statement text
repo categorical
base_commit text
version categorical
PASS_TO_PASS mixed
FAIL_TO_PASS mixed

Method And Limits

Load the catalog entry and preview rows from the processed dataset file.
Infer numeric, categorical, time, and location fields from real columns.
Generate a small set of defensive Plotly chart specifications from that profile.
Expose each chart idea as a query link so the report can be rerun or exported in Helix.

This report is intentionally reproducible. It uses the local catalog metadata and generated chart specifications rather than claiming external conclusions beyond the dataset.