Skip to content

Instantly share code, notes, and snippets.

@raprasad
Created April 29, 2019 16:06
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save raprasad/617e84aa56e78d3ca6428ee9df7e28b1 to your computer and use it in GitHub Desktop.
Save raprasad/617e84aa56e78d3ca6428ee9df7e28b1 to your computer and use it in GitHub Desktop.
ISI error with DA_college_debt
  • Dataset: DA_college_debt

(1) Initial request

{
    "userAgent": "TwoRavens",
    "version": "2019.2.27",
    "timeBound": 4,
    "priority": 1,
    "allowedValueTypes": [
        "DATASET_URI",
        "CSV_URI"
    ],
    "problem": {
        "problem": {
            "id": "problem 22",
            "version": "2.0",
            "name": "",
            "description": "The task here is to predict the median debt-to-earnings ratio of colleges across the US. The supplied dataset consists of the College Scorecards dataset which make it easier for students to search for a college that is a good fit for them. They can use the College Scorecard to find out more about a college's affordability and value so they can make more informed decisions about which college to attend, including factors such as median debt-to-earning ratio of graduates. This is a regression problem.",
            "taskType": "REGRESSION",
            "taskSubtype": "NONE",
            "performanceMetrics": [
                {
                    "metric": "ROOT_MEAN_SQUARED_ERROR"
                }
            ]
        },
        "inputs": [
            {
                "datasetId": "DA_college_debt_dataset_TRAIN",
                "targets": [
                    {
                        "resourceId": "0",
                        "columnIndex": 18,
                        "columnName": "DEBT_EARNINGS_RATIO"
                    }
                ]
            }
        ]
    },
    "template": {
        "inputs": [],
        "outputs": [],
        "steps": []
    },
    "inputs": [
        {
            "dataset_uri": "file:///ravens_volume/test_data/DA_college_debt/TRAIN/dataset_TRAIN/datasetDoc.json"
        }
    ]
}

(2) Response

{
    "searchId": "M4OKNB4BLL9TF2POPT6EV6"
}

(3) GetSearchSolutionsResults Request

{
    "searchId": "M4OKNB4BLL9TF2POPT6EV6"
}

(4a) Error returned via gRPC

<_Rendezvous of RPC that terminated with (StatusCode.UNKNOWN, Exception iterating responses: 'dimension')>

(4b) Error seen in TA2 Terminal

2019-04-29 16:03:25,497 [INFO] dsbox.server.ta2_servicer --     | Init invoked
2019-04-29 16:03:25,497 [INFO] dsbox.server.ta2_servicer --     \_____________
Namespace(debug_volume_map=[], load_pipeline=None, port=45042)
Root logger level 20
  d3m_run: ta2ta3
  input_dir: /ravens_volume/test_data/DA_college_debt
  problem_schema: /ravens_volume/test_data/DA_college_debt/TRAIN/problem_TRAIN/problemDoc.json
  output_dir: /ravens_volume/test_output/DA_college_debt
  local_dir: /ravens_volume/test_output/DA_college_debt/local_dir
  static_dir: /ravens_volume/test_output/DA_college_debt/static_dir
  cpu: 1
  ram: 1048576000
  timeout: 600
  timeout_search: 480
  search_method: parallel

2019-04-29 16:03:25,608 [INFO] dsbox.JobManager.DistributedJobManager -- <ForkProcess(ForkPoolWorker-3, started daemon)> > worker process started
2019-04-29 16:03:25,609 [INFO] dsbox.JobManager.DistributedJobManager -- <ForkProcess(ForkPoolWorker-3, started daemon)> > waiting on new jobs
2019-04-29 16:03:28,867 [INFO] dsbox.server.ta2_servicer --     | Hello invoked
2019-04-29 16:03:28,867 [INFO] dsbox.server.ta2_servicer --     \_____________
2019-04-29 16:03:30,655 [INFO] dsbox.server.ta2_servicer --     | SearchSolutions invoked
2019-04-29 16:03:30,655 [INFO] dsbox.server.ta2_servicer --     \_____________
2019-04-29 16:03:30,721 [INFO] dsbox.server.ta2_servicer --     | SearchSolutions returning
2019-04-29 16:03:30,722 [INFO] dsbox.server.ta2_servicer --     \_____________
2019-04-29 16:03:30,839 [INFO] dsbox.server.ta2_servicer --     | GetSearchSolutionsResults invoked with search_id: M4OKNB4BLL9TF2POPT6EV6
2019-04-29 16:03:30,840 [INFO] dsbox.server.ta2_servicer --     \_____________
04-29 16:03:30 [INFO] dsbox.controller.controller -- Top level output directory: /ravens_volume/test_output/DA_college_debt
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.graph.GraphMatchingParser: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: primitive family segment must match primitive's primitive family. This API will be made mandatory after January 2019 and this warning will become an exception.
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.graph.GraphMatchingParser: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: must have a known primitive name segment.
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.graph.GraphTransformer: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: primitive family segment must match primitive's primitive family. This API will be made mandatory after January 2019 and this warning will become an exception.
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.graph.GraphTransformer: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: must have a known primitive name segment.
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.psl.LinkPrediction: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: primitive family segment must match primitive's primitive family. This API will be made mandatory after January 2019 and this warning will become an exception.
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.psl.LinkPrediction: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: must have a known primitive name segment.
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.psl.GraphMatchingLinkPrediction: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: primitive family segment must match primitive's primitive family. This API will be made mandatory after January 2019 and this warning will become an exception.
04-29 16:03:31 [WARNING] d3m.metadata.base -- d3m.primitives.sri.psl.GraphMatchingLinkPrediction: Primitive's Python path does not adhere to d3m.primitives namespace specification (see https://gitlab.com/datadrivendiscovery/d3m/issues/3). Reason: must have a known primitive name segment.
04-29 16:03:31 [INFO] dsbox.template.library -- DSBoxTemplate:SRI_Mean_Baseline_Template has been added to template base.
04-29 16:03:31 [ERROR] grpc._server -- Exception iterating responses: 'dimension'
Traceback (most recent call last):
  File "/usr/local/lib/python3.6/dist-packages/grpc/_server.py", line 408, in _take_response_from_response_iterator
    return next(response_iterator), True
  File "/user_opt/dsbox/dsbox-ta2/python/dsbox/server/ta2_servicer.py", line 260, in GetSearchSolutionsResults
    status = self.controller.train()
  File "/user_opt/dsbox/dsbox-ta2/python/dsbox/controller/controller.py", line 1165, in train
    self._check_and_set_dataset_metadata()
  File "/user_opt/dsbox/dsbox-ta2/python/dsbox/controller/controller.py", line 160, in _check_and_set_dataset_metadata
    self.all_dataset.metadata.query((resource_id, ALL_ELEMENTS))['dimension']['length'] - 1,
  File "/usr/local/lib/python3.6/dist-packages/frozendict/__init__.py", line 29, in __getitem__
    return self._dict[key]
KeyError: 'dimension'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment